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(57) Abstract 

The invention relates to new expression systems and in particular to an expression system in which a gene of interest is expressed 
at an optimal level. The invention provides a recombinant expression vector comprising a gene of interest and a selectable marker gene, 
wherein the selectable marker gene is arranged downstream of the gene of interest and a stop codon associated with the gene of interest is 
spaced from a start codon of said selectable marker gene at a distance which is sufficient to ensure that translation reinitiation is required 
before said selectable marker protein is expressed from the corresponding MRNA. Examples of such expression systems are vector viral 
packaging cell lines and a number of preferred cell lines have been identified. 
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Expression systems 

The present invention relates to new expressions systems, 
5 and in particular to expression systems in which a gene of 
interest is expressed at an optimal level. Particular 
examples of such expression systems are retroviral packaging 
cell lines and a number of preferred cell lines have been 
identified. 

10 

The ability of eukaryotic and prokaryotic ribosomes to 
reinitiate translation at an internal start codon within an 
mRNA sequence has previously been recognised. Studies have 
been reported in which the efficiency of the process, which 

15 is generally regarded as being low, has been connected with 
the length of the intercistronic sequence (Kozak (1987) 
Mol. Cell Biol. 7, 3438-3445). Selection of this sequence 
or spacer as 70bp in length, and containing no other start 
codons, has been previously reported as being optimal for « 

20 reinitiation in a eukaryotic cell line (Cosset F-L. , 
Virology (1991) 185, 862) . 

The applicants have found a way in which the inefficiency 
associated with the translation reinitiation process can be 
25 used to good effect. 

According to the present invention there is provided a 
recombinant expression vector comprising a gene of interest 
and a selectable marker gene, wherein the selectable marker 

3 0 gene is arranged downstream of the gene of interest and a 
stop codon associated with the gene of interest is spaced 
from a start codon of said selectable marker gene at a 
distance which is sufficient to ensure that -translation re- 
initiation is required before said selectable marker protein 

35 is expressed from the corresponding mRNA. 
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The invention further provides a process for producing cell 
lines in which a gene of interest is expressed, which 
process comprises transforming host cells with an expression 
vector comprising said gene of interest and a selectable 
marker gene, wherein the selectable marker gene is arranged 
downstream of the gene of interest and a stop codon 
associated with the gene of interest is spaced from a start 
codon of said selectable marker gene at a distance which is 
sufficient to ensure that translation re-initiation is 
required before said selectable marker protein is expressed 
from the corresponding mRNA, and selecting those cells where 
expression of the selectable marker gene may be detected* 

Since re- initiation of translation is a relatively- 
inefficient process, this means that the selectable marker 
protein will be expressed at lower levels than the product 
of the gene of interest. When the marker protein is 
expressed at detectable levels, the gene of interest will be 
expressed at higher levels. This will ensure that during 
the subsequent selection procedure, only those cell clones 
which express the gene of interest at higher or optimal 
levels will survive. Low expressing clones will be 
eliminated by the selection process. 

Cells transformed with the above-described expression 
vectors form a further aspect of the invention. 

The host cells are suitably eukaryotic or prokaryotic host 
cells, preferably eukaryotic host cells. 

The number of nucleotides in the space between the stop 
codon of the gene of interest and the start codon of the 
selectable marker will suitably be in the range of from 20- 
200 nucleotides, preferably from 60-80 nucleotides, even 
more preferably 70-80 nucleotides. 



WO 97/08330 



PCT/GB96/02061 



3 

The vectors used in the process of the invention may Be any 
of the known types, for example expression plasmids or viral 
vectors . 

5 Selected cells may be cultured and if required, the protein 
product of the gene of interest isolated from the culture 
using conventional techniques. Alternatively, expression of 
the gene of interest may result in other desired effects, 
for example, where the gene of interest is included as part 
10 of a viral packaging construct. 

Some experimental and clinical gene transfer protocols 
require the design of gene transfer vectors suitable for in 
vivo gene delivery (Miller, A.D. 1992. Nature 357:455- 

15 460) . Retroviral vectors are attractive candidates for 
such applications, because they can provide stable gene 
transfer and expression (Samarut J, et al.,Meth. Enzymol. in 
press) and because packaging cells have been designed which 
produce non-replication competent viruses (Miller A.D (1990) 

20 Hum Gene Ther. 1 5-14) . However currently available 

recombinant retroviruses suffer from a number of drawbacks. 

Packaging cell lines provide in trans the retroviral 
proteins encoded by the gag , pol , and env genes required to 

25 obtain infectious retroviral particles. The gag and pol 

products are respectively the structural components of the 
virion cores and the replication machinery (enzymes) of the 
retroviral particles whereas the env products are envelope 
proteins responsible for the host -range of the virions and 

30 for the initiation of infection and for sensitivity to 

humoral factors. An ideal packaging cell line should 
produce retroviruses that only contain the retroviral vector 
genome, and absolutely no replication- competent genomes or 
defective genomes encoding some of the viral structural 

3 5 genes. 
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A number of packaging cell lines designed for human gene 
transfer have been designed in the past by introducing 
plasmid DNAs which contain "helper genomes" encoding gag, 
pol and/or env genes into cells. 

5 

* 

Retroviral packaging cell lines are cells that have been 
engineered to provide in trans all the functions required to 
express infectious retroviral vectors. A helper genome (or 
construct or unit) , is herein also referred to as 
10 "retroviral packaging construct (or unit)." or n packaging- 
deficient construct (or genome unit)." or *ga.g-pol/env 
expression plasmids" . 

Much efforts has been made to design strategies to optimize 
15 the helper-genomes in order (i) to get the highest 
production of retroviral packaging functions (which 
correlates which infection titers of retroviral particles) 
and (ii) to minimise the chance that the helper genome can 
be transmitted via the viral particles (which may lead to 
20 emergence of unwanted retroviral forms) . 

The first of these packaging cell lines used full length 
retroviral genomes as helper genomes that had been crippled 
for important cis-regulated replicative functions (reviewed 

25 in Miller, Hum. Gene. Ther. 1:5-14 1990). In order to reduce 
the possibility of occurrence of replication-competent 
viruses and of transfer of virus structural genes, a second 
generation of safer packaging cell lines has been designed 
by using two separate and complementary helper genomes which 

3 0 express either gag-pol or env and are packaging-deficient 
(Miller supra) . 

The cells into which these helper genomes were introduced 
were isolated by cotransf ecting them with plasmids encoding » 
35 selectable markers. However, as no selection was applied on 
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the packaging-deficient retroviral genome itself, the helper 
.functions can be lost during the passages of the cells in 
culture and the current packaging systems provide limited 
titers of infectious retroviral vectors, usually only of the 
order of 10 s - 10 s infectious units i.u/ml. Indeed the 
cotransf ection with a plasmid encoding a selectable marker 
does not directly select the best gag-pol-env-expressing 
cells . 

The invention further provides a retroviral packaging cell 
line comprising a host cell transformed with (i) a packaging 
deficient construct which expresses a viral gag-pol gene and 
a first selectable marker gene, and/or (ii) a packaging- 
deficient construct which expresses a viral env gene and a 
second selectable marker gene; wherein a start codon of the 
first and second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that reinitiation 
of mRNA translation is required for expression of marker 
protein product of said first and/or second selectable 
marker gene. 

The retroviral packaging cell line may be obtained by the 
above described process which will involve selecting 
transfected cells which express said first and/or second 
marker genes. 

By using helper constructs which are directly selectable and 
which provide for high expression of the viral gene, high 
titre retroviral vectors may be obtained. 

Helper constructs for use in the process form a furtner 
aspect of the invention. 

The retroviral vectors prepared from the conventional 
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packaging cell lines are usually not contaminated by * 
replication-competent retroviruses (RCRs) . However, 
recombinant amphotropic murine retroviruses have been shown 
to arise spontaneously from certain packaging cell lines. 
5 The generation of such RCRs involves recombination at least 
between gag-pol/env packaging sequence and vector sequences 
(Cosset et al., Virology, (1993) 193:385-395). 

Recombinant RCRs have been associated with the development 
10 of lymphomas in some severely immunosuppressed monkeys 
(Donahue et al., J. Exp Med (1992) 176: 1125-1135). In 
addition, retroviral vector preparations may also contain, 
at low frequencies, retroviruses coding for functional 
envelope glycoproteins (Kozak and Kabat, 1990, J. Virol. 64: 
15 3500-3508) or for gag-pol proteins. Although the 
pathogenicity of these gag-pol or env recombinant 
retroviruses is probably low, more evolved recombinant 
retroviruses with higher pathogenic potential may occur when 
injected in vivo, by recombination and/or complementation of 
20 the initial recombinant viruses with some endogenous 
retroviruses. 

In a preferred embodiment of the retroviral packaging cell 
lines of the invention, the overlapping sequences between 

25 the genomes of the retroviral vector and the helper 

construct are reduced, for -example as compared to constructs 
such as CRIPenv and CRIPAMgag (Danos et al., Proc. Natl. 
Acad. Sci USA 85: 6460-6464) . In particular, the viral 
sequences in the helper construct are reduced, for example, 

30 not only the packaging sequence but also the 3' Long 

Terminal Repeat (LTR) , the 3' non-coding sequence and/or the 
5'LTR may be eliminated. 



35 



The possibility of generation of such RCRs and recombinant 
retroviruses can be reduced by reducing the overlapping 



WO 97/08330 



PCT/GB96/02061 



7 

sequences between the genomes of both the retroviral vector 
and the helper construct. 

Conventional retroviral vectors are strongly inactivated by 
human serum which makes them of limited or no use for in 
situ gene transfer in gene therapy applications. It has 
previously been shown that inactivation by complement in 
human serum is controlled by the cell line used to produce 
the virions and by viral envelope determinants (Takeuchi et 
al., J. Virol (1994) 68:8001-8007). In particular, 
inactivation is caused by some properties of the cell lines 
that have been used to construct the packaging cells (NIH- 
3T3) and also by viral determinants located in the 
retroviral envelope as shown (Takeuchi et al., J. Virol 
(1994) 68:8001-8007). In vivo crene delivery is an important 
goal for a number of human gene therapy strategies. 

The applicants have found that certain cell lines form 
preferred packaging cell lines. 

Particularly preferred packaging cell lines are the HT1080 
line, the TE671 line, the 3T3 line, the 293 line and the Mv- 
1-Lu line. One example of retroviral packaging cells that 
will produce complement -resistant virus comprise human 
HT1080 cells and express RD114 envelope. Such cells form a 
preferred aspect of the invention. 

Packaging cell lines according to the invention provide 50- 
100 fold increased titers of retroviral vectors as compared 
to conventional packaging cell lines. Retroviral vectors 
provided by these new cells are safe, in terms of generation 
of RCRs, and considerably more resistant to inactivation by 
human complement. 

Packaging cell lines according to the invention may be able 
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to transduce helper-free, human complement- resistant 
retroviral vectors at titers consistently higher than 10 7 
i.u./ml. 

Suitable semi -packaging cell lines in accordance with the 
invention are those which express only the gag-pol genes. 
Such cell lines may suitably be derived from TE671, MINK Mv- 
l~Lu, HT1Q80, 293 or NIH-3T3 cells by introduction of 
plasmid CeB (the MoMLV gag-pol expression unit) . 

Particularly preferred expression vectors in accordance with 
the invention for use in retroviral packaging cell lines are 
those which include MLV gag and pol genes such as CeB. 
Other plasmids may include gag and pol genes from other 
retroviruses or chimeric or mutated gag and pol genes. 

Various viral and retroviral envelope genes may be included 
in the plasmids such as MLV- A envelope, GALV envelope, VSV-G 
protein, BaEV envelope, RD114 envelope and chimeric or 
mutated envelopes . Plasmids which include the RD114 env 
gene such as FBdelPRDSAF as illustrated hereinafter, provide 
one example of suitable constructs. 

The novel retroviral packaging cells described hereinafter, 
have been designated FLY cells, and may be designed for in 
vivo gene delivery. 

Considerable variations were found between the various cell 
lines screened for their ability to release type C mammalian 
retroviruses. In addition, few cell lines were able to 
produce retroviruses completely resistant to human 
complement. Based on these two criteria, human fibrosarcoma 
HT1080 and rhabdomyosarcoma TE571 cells were selected for 
optimum construction of packaging cells. 
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Other studies have shown the importance of endogenous" 
retrovirus expression in the generation of recombinant 
retroviruses from retroviral packaging lines (Ronfort et 
al., Virology, (1995), 207, 271-275, Vanin, E.F.et al., J 
Virol (1994) 68:4241-4250.). The co-packaging of an 
endogenous genome and a vector can lead to emergence of 
recombinant retroviruses (Vanin et al., supra). 
Recombination involves template switching during reverse 
transcription of such hybrid retroviruses (Hu et al., • 
Science, (1990) 250:1227) and homologies between the two 
genomes considerably enhance the frequency of reverse 
transcriptase jumps (Zhang et al., J. Virol. (1994) 68: 
2409-2414) . Therefore an ideal packaging cell line should 
not express endogenous MLV-like (or type C retrovirus-like) 
retroviral genomes which can be packaged by type C gag 
proteins (Scadden et al., J . Virol. (1990) 64: 424-427, 
Torrent et al. f J. Mol . Biol. (1994) 240 434-444). 

Packaging of human endogenous retroviral RNA was not 
detected in TELCeB and FLY packaging cells when virion 
associated RNA was analysed by RT-PCR using 'generic primers. 

HT1080- and TE671 derived packaging cell lines may be safer 
in this respect than those generated from NIH3T3 cells, such 
as GP+EAM12 cells, which are known to express and package 
sequences related to type C retroviruses (Scadden et al . 
supra) . 

To generate the FLY packaging cell lines, HT1080 cells were 
transfected with gag-pol and env expression plasmids 
designed to optimise viral protein expression. Direct 
selection for viral gene expression was achieved in 
accordance with the invention by expression of a selectable 
marker gene by re-initiation of translation of the mRNA 
expressing the viral proteins. This strategy resulted in 
packaging cell lines capable of producing extremely high 
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titer viruses. Furthermore, long-term expression of 
packaging functions can be maintained in these cells. Many 
unnecessary viral sequences were eliminated from the 
packaging constructs to reduce the risk of helper virus 
5 generation; indeed the final packaging cells did not produce 
helper virus, in that no replication competent virus (RCR) 
could be detected per 10 7 vector particles. 

The FLY packaging cells described herein are safer than, for 
10 example, psiCRIP cells, at least for generation of env 
recombinant retroviruses as is illustrated in Table 4 
hereinafter, probably because less retroviral sequences 
overlapping with the vector were present in the present env- 
expression plasmid. Few reports have addressed the question 
15 of the characterization of recombinant retroviruses (RVs) 
(Cosset, F.L., et al. f Virology (1993) 193:385-395), It is 
possible that such RVs could not be detected in previous 
packaging cell lines due to lower overall titers. RVs are 
defective in normal cell culture conditions but are likely 
20 to evolve to replication competent viruses if they are 

allowed to replicate in cells complementing their expression 
like co-cultivated packaging cells {Bestwick et al., Proc. 
Natl Acad Sci USA, (1988) 85: 5404-5408, Cosset et al., 
(1993) supra) . 

25 

In preferred retroviral packaging systems according to the 
invention, RVs are eradicated for example by removal of 
viral LTRs from the packaging construct. 

30 Consistent with our previous studies (Takeuchi, Y. , et al., 
J Virol (1994) 68:8001-8007), LacZ(RD114) and lacZ (MLV-A) 
pseudotypes produced from HT1080 and TE67lcells were more 
resistant to human complement than Lac2(RD114) or LacZfMLV- 
A) pseudotypes produced by 3T3 of dog cells. It was 

35 therefore decided to use RD114 and MLV-A env genes to 
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generate recombinant virions with MoMLV cores . 

The sequence of RD114 env gene was determined and is shown 
in Figure 4. It was found to be very close to BaEV (baboon 
endogenous virus) a type C retrovirus (Benveniste, R.E.et 
al., Proc. Natl. Acad. Sci. USA (1973) 70:3316-3320; Kato, 
S.et al., Japan. J. Genet. (1987) 62:127-137) with an 
envelope gene displaying similarities to the external part 
of type D simian retroviruses (SRVs) . RD114 uses the SRV 
receptor on human cells (Sommerfelt & Weiss, Virology 
(1990) 176:58-69; Sommerfelt, M.A. et al., J Virol (1990) 
64:6214-6220) making the FLY packaging cells with RD114 
envelope capable of generating virions with different 
tropism. Retroviral vectors prepared so far for human gene 
therapy have used either MLV-A or GALV (gibbon ape leukemia 
virus) envelopes which display some similarities (Battini, 
J.L.,et al., J Virol. (1992) 66:1468-1475) and which use two 
related cell surface receptors for infection (Miller, D.G. 
et al., J Virol (1994) 68:8270-8276). Differences in tissue- 
specific expression of MLV-A or GALV receptors have been 
reported (Kavanaugh et al., Proc Natl Acad Sci USA (1994) 
91:7071-7075) . * 

The invention will now be particularly described by way of 
example with reference to the accompanying drawings in 
which: 

Figure 1. illustrates the structure and expression of CeB. 
The env gene (Xbal-Clal) of plasmid pCRIP was removed and 
was replaced by coinsertion of the two fragments Xbal-Sfil 
(restriction sites underlined) from pOXEnv and a Sfil-Clal 
PCR product containing the bsr selectable marker. This 
results in positioning the bsr start codon (shadowed) 74 bp 
downstream to the pol stop codon (bold).. 
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Open triangle are start codons ( aacr and bsr) , black 
triangles are stop codons ( pol and bsr ) . The shadowed 
triangle is the start codon of env , in the same reading 
frame with that of bsr. SD and SA are the splice donnor and 
5 splice acceptor sites. 

Figure 2 illustrates the structure and expression of 
FbdelPASAF. 

Immediately after the stop codon of env (bold) was inserted 
10 a non retroviral Kasl-Ncol (restriction sites underlined) 
linker which positions the phleo start codon (shadowed) 76 
bp downstream. 

Open triangle are start codons (env and phleo ) , black 
triangles are stop codons ( env and phleo ) . SD and SA.are ' 
15 the splice donnor and splice acceptor sites. 

Figure 3 illustrates plasmids for expression of Anipho, Eco, 

RD114, Xeno, 10A1, GALV, VSV-G and FeLVB envelopes. 

All genes are expressed in the same backbone as detailed in 

20 fig. 2. The Bglll sites for ecotropic (MoMLV strain), 

10A1, xenotropic (NZB.1.V6 strain) and amphdtropic (4070A 
strain) , the Ndel site of RD114 (SC3C strain, the BamHl site 
for both FeLVB and GALV were used as 5' ends, and linked to 
Mscl site immediately after the splice donor site in the 

25 leader of FB29 LTR. 

Figure 4 shows the sequence of the RD114 env gene (SEQ ID No 
1)'. 

3 0 Figure 5 shows the genetic structure of gag-pol constructs. 

Initiation W and termination (T) codons are shown. The 
thick dotted line below each construct shows MLV-derived 
sequences. Nucleotide positions of MLV-derived sequences are 
shown according to: Shinnick et al. (1981) (f^om nt 1 to nt 

35 6000 with deletion of the packaging signal (DY) from Ball 
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(nt 215) to PstI (nt 568) , and with some further MoMLV 
sequences in both CeB and CeB DS- from nt 7676 to nt 793 8. 
gag-pol and bsr genes were expressed from the same 
transcription unit using the either a retroviral promoter 
(Mo LTR) or a non retroviral promoter (hCMV) and non 
retroviral polyadenylation sequence (polyA) . Splice donor 
(SD) and acceptor (SA) sites are indicated. The thin line 
denotes retroviral non coding sequences. The thick line 
shows the rabbit beta-1 globin intron B. The position of 
some restriction sites is indicated. 

The nucleic acid sequences of portions of constructs (as 
shown in Figure 5 (boxed areas)) are displayed for CeB (SEQ 
ID No 2, Figure 6) , hCMV+ intron (SEQ ID No 3, Figure 7) and 
hCMV+intronkaSD (SEQ ID No 4, Figure 8) . 

The nucleic acid sequences of portions of constructs (as 
shown in Figure 3 (boxed areas) ) are displayed for 
FbdelPASAF (SEQ ID No 5 , Figure 9), FbdelPMOSAF (SEQ ID No 
6, Figure 10), FbdelPGASAF (SEQ ID No 7, Figure 11), 
FbdelPRDSAF. (SEQ ID No8, Figure 12) and CMV10A1 (SEQ ID No 
9, Figure 13) are shown. 

The components of the viral particles are produced by two 
independent expression plasmids ( gag - pol or env) which also 
contain selectable markers ( bsr or phleo ) expressed from the 
same transcriptional units as gag-pol or env (figs, 1& 2) . 
The selectable markers are located downstream to gag - pol or 
env genes and there is an optimal distance between the stop 
codon of the upstream reading frames and the start codon of 
the selectable genes that should allow re- initiation of 
translation (Kozak, Mol Cell Biol. (1987) 7, :3438-3445) . 
Because there is no "Kozak" sequence (Kozak, Cell, (1986) 44: 
283-292) required for a normal initiation of translation for 
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the marker gene, they can only be expressed by re* initiation 
of translation after the upstream viral gene has been 
successfully expressed. Consequently and also because re- 
initiation of translation is a poorly efficient process, 
5 after transfection of these plasmids, cells resistant to the 
drugs corresponding to those selectable genes express high 
levels of the viral proteins. 

To avoid viral transmission of these "helper" genomes the 
10 constructs used suitably have the classical deletions of 

both the packaging, sequence located in the leader region and 
of the 3'LTR, the latter being replaced by SV40 
polyadenylation sequences (Figs 1 & 2) . 

15 Plasmid CeB is the MoMLV gag-pol-expression -unit. It 

derives from pCRIP, a plasmid used to generate the 
constructs introduced in the CRIP and CRE packaging cell 
lines {Danos and Mulligan, 1988) . As shown in fig. 1 for 
generation of plasmid CeB the eny gene of pCRIP has been 

20 deleted mostly and the bsr selectable marker, -encoding a 
protein conferring resistance to blasticidin (Izumi et al., 
Experimental Cell Research {1991) 197, 229-233)- has been 
inserted downstream to pol gene. There are exactly 74 bp 
with no ATG triplets between the stop codon of pol and the 

25 start codon of bsr , this allows its expression by re- 
initiation of translation on the gag-pol mRNA, after 
translation of the gacr - pol reading frame. 

FbdelPASAF is a plasmid expressing the amphotropic env gene 
3 0 and the phleo selectable' marker conferring resistance to 
phleomycin (Gatignol et al., FEBS Letters (1988) 230:171- 
175) . By using a PCR-mediated mutagenesis strategy which 
modifies the end of env gene (see fig. 2) , a 76 bp linker 
was inserted between the stop codon of env and the start 
35 codon of phleo . This allows expression of phleo from the 
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env mRNA by re- initiation of translation. In addition 
compared to known env-expressing constructs, this strategy 
of construction has reduced the length of sequences 
overlapping with the ends of conventional retroviral 
5 vectors. The env genes of Mo-MLV, FeLVB, NZB.1V6, 10A1, 
GALV and RD114 are expressed by plasmids FBdelPMoSAF, 
FBdelPBSAF, FBdelXSAF, FBdelpGSAF, FBdelplOAlSALF and 
FBdelPRDSAF, respectively, by using the same backbone as 
FBdelPASAF (fig. 3) . Retroviral vectors produced with the 
10 RD114 envelope, will be useful for in vivo gene delivery as 
comparatively to MLV ecotropic or amphotropic envelopes, 
virions pseudotyped with RD114 envelopes are not inactivated 
by human complement when they are produced by Mink Mv-l-Lu 
cells or by some human cells (Table 1) . 

15 

The HT1080 cell line, isolated from a human fibrosarcoma 
(ATCC CCL121) . The TE671 cell line isolated from a human 
rhabdomyosarcoma (ATCC CRL 8 805) (purchased from ATCC, and 
tested for absence of usual cell culture contaminants by 

20 ECACC) , has been used for the definitive construction of 

packaging cell lines. HT1080 line was chosen among a panel 
of primate and human lines because MLV -A and RD114 
efficiently rescued retroviral vectors from these cells and 
also because RD114 pseudotypes produced by this cell line 

25 were stable when incubated in human serum. In a standard 

assay (Takeuchi et al., J Virol (1994), 68, 8001-8007), 
these latter viruses were found more than 500 fold more 
stable than similar pseudotypes produced in 3T3 cells. 

30 Another advantage for the use of non murine cells to derive 
packaging lines is the absence of MLV- related endogenous 
retroviral-like sequences (like VL30 in 3T3 cells) that can 
cross -package with MLV-derived retroviral vectors (Torrent 
et al., 1994) and generate potentially harmful recombinant 

35 retroviruses. 
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The helper constructs were introduced into other cell lines 
(HT1080 (table 2) Mink Mv-l-Lu (table 2), 3T3 (not shown), 
TE671 (table 2)) for the purpose of comparisons of the 
efficiency of the constructs. 

As illustrated hereinafter (Table 2) , the reverse 
transcriptase (RT) activity (provided by expression of the 
pol gene) in cells transfected with CeB is significantly 
higher than that of the same cells transfected by the 
parental plasmid pCRIP or that of cells chronically infected 
by MLV. This enhancement of viral gene expression is 
correlated with the titers of lacZ retroviral vectors when 
an envelope is provided in CeB-lacZ cells after comparison 
with titers of lacZ pseudotypes of either replication- 
competent viruses or other helper- free packaging systems. 

For the generation of final packaging cell lines, the best 
clonal env transf ectants have been selected. Packaging 
systems obtained in this way will be able to produce helper- 
free retroviral vectors at titers greater than 10 8 
infectious particles per ml, which would be 10-100 fold 
higher to helper-free preparations of others. 

Because of the way the selectable markers are expressed (see 
above) , growing the packaging cells in phleomycin and 
blasticidin selective pressure increase and stabilize the 
expression of the retroviral components and particularly the 
envelopes, as it is possible that env glycoproteins have 
toxic effects for the producer cells in the long term which 
may lead to a decrease of expression. 

Such an enhancement of viral production observed with the 
packaging systems described herein might increase the 
emergence of unwanted retroviruses having recombined between 
the genomes of both the retroviral vector and either of the 
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two packaging-deficient constructs. However, the constructs 
have been designed in such a way that it reduces the 
probability of emergence of recombinant viruses compared to 
the parental constructs. To check their safety, attempts 
have been made to detect the presence of replication- 
competent retroviruses by a mobilisation assay of a lacZ 
provirus. No RC viruses have been found in all retroviral 
vector preparations tested so far. 

The following Examples illustrate the invention. 

Example 1 

Preparation of Cell lines and viruses. 
The following cell lines were used: 

A204 (ATCC HTB 82), HeLa {ATCC CCL2) , RT1080 (ATCC CCL121) , 
MRC5 (ATCC CCL171) , T24 (ATCC HTB 4); VERO (ATCC CCL81) and 
D17 (ATCC CCL183) were purchased from ATCC. 

HOS, TE671 and Mv-l-Lu cells and their clones harboring 
MFGnlslacZ retroviral vector as described by Takeuchi et 
al., J Virol (1994), 68, 8001-8007. 

The above cell lines were grown in DMEM (Gibco-BRL, U.K.) 
supplemented with 10% fetal calf serum. 

EB8 (Battini et al., J. Virol £1992) 66: 1468-1475); 
psiCRE, psiCRELLZ and psiCRIP (Danos et al., Proc. Natl. 
Acad. Sci USA (1988) 85: 6460-6464); 

Cells GP+EAM12 (Markowitz et al., Virology (1988), 167, 400- 
406) ; and 

NIH-3T3 murine fibroblasts. 

These cell lines .were grown in DMEM (GIBCO-BRL, U.K.) 
supplemented with 10% new-born calf serum. 
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Mv-l-Lu, TE671 and HT1080 cells were transfected using 
calcium- phosphate precipitation method (Sambrook et., 
"Molecular Cloning" 1989, Cold Spring Harbour Laboratory 
Press: New York) as described elsewhere (Battini et al. ( 
5 supra) * CeB- transfected Mv-l-Lu, TE671 and HT1080 cells 

were selected with 3, 6-8 and" 4 /zg/ml of blasticidin S (ICN, 
UK) , respectively, and blasticidin-resistant colonies were 
isolated 2-3 weeks later. Cells transfected with the various 
env-expression plasmids were selected with phleomycin 
10 (CAYLA, France) : 50 /zg/ml (for FBASALF- trans fee ted cells) or 

10 ptg/ral (for FBASAF- , FbdelPASAF- ( FbdelPMOSAF, 
FBdelPIOAISAF or FBdelPRDSAF- transfected cells) . Phleomycin- 
resistant colonies were isolated 2-3 weeks later. 

15 Production of lacZ pseudotypes using replication competent 
viruses, amphotrcpic murine leukemia virus (MLV-A) 1504 
strain and cat endogenous virus RD114, was carried out as 
described previously (Takeuchi et al., J Virol (1994), 68, 
8001-8007) . 

20 

Example 2 

Preparation of Plasmids. 

The env gene of pCRIP (Danos et al., supra) was excised by 
25 Hpal/Clal digestion. A 500 bp PCR-generated DNA fragment was 
obtained using pSV2-bsr (Izumi et al., Experimental Cell 
Research (1991), 197, 299-233) as template and a pair of 
oligonucleotides : 

(5 ' >CGGAATTCGGATCCGAGCTCGGCCCAGCCGGCCACCATGAAAACATTTAACATTTC 

30 TO {SEQ ID NO 2) at 5' end and 

(5 ' >GATCCATCGATAAGCTTGGTGGTAAAACTTTT) (SEQ ID No 3) at 3' 
end, with Sfil and Clal sites, respectively. This fragment 
was inserted in Hpal/Clal sites of pCRIP by co-ligation with 
a 85 bp Hpal/Sfil DNA fragment isolated from gOXEnv (Russell 

35 et al., Nucleic Acids Research (1993), 21, 1081-1085) which 
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provides the end of the Moloney murine leukemia virus* 
(MoMLV) pol gene. The resulting plasmid named CeB (Fig. 1) 
could express the MoMLV gag-pol gene as well as the bsr 
selectable marker conferring resistance to blasticidin S, 
5 both driven by the MoMLV 5'LTR promoter. 

A series of env-expression plasmids was generated using the 
4070A MLV (amphotropic) env gene (Ott et al. t , J Virol 

(1990) , 64, 757-766) and the FB29 Friend MLV promoter 

10 (Perryman et al., Nucleic Acid Res (1991), 19, 6950). In 

FBASALF (Fig. 1) a Bglll/Clal fragment containing the env 
gene was cloned in BamHI/Clal sites of plasmid FB3LPh which 
also contained the C57 Friend MLV LTR driving the expression 
of the phleo selection marker. A 13 6 bp env fragment was 

15 generated by PCR using plasmid FB3 (Heard et al . , J Virol 

(1991) , 65, 4026-4032) as template and a pair of 
oligonucleotides: ( 5 ' >GCTCTTCGGACCCTGCATTC) (SEQ ID NO 4) at 
5' end (before Clal site) and 

(5 ' >TAGCATGGCGCCCTATGGCTCGTACTCTATAGGC) (SEQ ID NO 5) at 3' 

20 end, providing a KasI restriction site immediately after the 

env stop codon. This PCR fragment was digested using Clal 
and KasI. ADNA fragment containing the FB29 LTR and the 
MLV -A env gene was obtained by Ndel/Clal digestion of 
FBASALF. The fragments were co-ligated in Ndel/KasI digested 

25 pUT626 (kindly provided by Daniel Drocourt, CAYLA labs, 

France) . In the resulting plasmid, named FBASAF (Fig. 1) , 
the phleo selectable marker was expressed from the same mRNA 
as the env gene. A Bglll restriction site was created after 
the MscI site at position 214 in the FB29 leader by using a 

30 commercial linker (Biolabs, France) . A Ndel/Bglll fragment 

containing the FB29 LTR was co-inserted with the Bglll/Clal 
env fragment in Ndel/Clal -digested FBASAF plasmid DNA, 
resulting in plasmid FBdelPASAF (Fig. 1) . Compared to 
FBASAF, FBdelPASAF has a lOObp larger deletion in the leader 

35 region. 
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Example 3 

Cloning and Sequencing of the RD114 eriv gene 
The RD114 env gene was first sub-cloned in plasmid 
5 Bluescript KS+ (Stratagene) as a 3 Kb Hindi I I insert 

isolated from SC3C, an RD114 infectious DNA clone (Reeves et 
al., J. Virol (1984> , 52, 164-171). A 2.7 kb Seal-Hind III 
fragment .of this subclone containing the RD114 env gene was 
sequenced (Figure 4 (SEQ ID NO 1) - EMBL accession number; 

10 X8782 9) . The 5' non- coding sequence upstream of an Ndel site 
was deleted by an EcoRI/Ndel digestion followed by filling - 
in with Klenow enzyme and self -ligation. From this plasmid, 
two DNA fragments were obtained: a BamHI/NcoI 2.5 Kb 
fragment and a 63 bp PCR-generated DNA fragment using 

15 (5 ' >CGCCTCATGGCCTTCATTAA) (SEQ OD NO 6) at 5' end (before 

NotI site) and (5 ' >TAGCATGGCGCCTCAATCCTGAGCTTCTTCC) (SEQ ID 
NO 7) at 3' end, providing a KasI restriction site just 
after RD114 env gene stop codon. The PCR fragment was 
digested with Ncol and KasI. Both fragments were co- 

20 inserted between Bglll and KasI sites of FBdelPASAF and the 

resulting plasmid was named FBdelPRDSAF (Fig. 1) . 

Plasmid pCRIPAMgag- (Danos, 0. et al., Proc Natl Acad 
Sci USA (1988) 85:6460-6464) was used for transf ection . 

25 Example 4 

Infection assays, 

Targeit cells were seeded in 24-multiwell plates (4xl0 4 cells 
per well) and were incubated overnight.- Infections were then 

30 carried out at 37°C by plating 1 ml dilutions of viral 

supernatant s in the presence of 4 ^g/ml polybrene (Sigma) on 
target cells. 3h later virus -containing medium was replaced 
by fresh medium and infected cells were incubated for two 
days before X-gal staining, performed as previously 

35 described (Tailor et al., J Virol (1993), 67, 6737-6741, 
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Takeuchi et al., J Virol (1994), 68, 8001-8007). Viral 
titers were determined by counting lacZ-positive colonies as 
previously described (Cosset et al., J. Virol. (1990) 64: 
1070-1078) . Stability of lacZ pseudotypes in fresh human 
serum was examined by titrating surviving virus after 
incubation in 1:1 mixture of virus harvest in serum-free 
medium and fresh human serum for 1 h at 37° C as described 
before (Takeuchi et al. supra). 

Example 5 

Reverse transcriptase (RT) assay. 

RT assays were performed either as described previously 
(Takeuchi et al . supra) or using an RT assay kit (Boehringer 
Mannheim, U.K.) following the manufacturer's instruction but 
using MnCl 2 (2 miM) instead of MgCl 2 * 

Example 6 

Screening producer cell lines. 

Viral particles generated with RD114 envelopes have been 
found to be more stable in human serum than virions with 
MLV-A envelopes and that the producer cell line also 
controls sensitivity (Takeuchi- et al. supra) . A panel of 
cell lines was screened for their ability to produce high 
titer viruses and for the sensitivity of these virions to 
human serum. To do this, cells were infected at high 
multiplicity with lacZ pseudotypes of either MLV-A or RD114 
and cells producing helper-positive lacZ pseudotypes were 
established. Human HT1080 and TE671 and mink Mv-l-Lu cells 
were found to release high titer lacZ(RD114) and lacZ (MLV-A) 
viruses. LacZ (MLV-A) pseudotypes produced by HT1080 cells 
were more resistant to human serum than those produced by 
other cells. The titer of these viruses was only four- fold 
less following a 1 hr incubation with human serum than a 



WO 97/08330 



PCT7GB96/02061 



22 

control incubation (Table 1). LacZ(RD114) pseudotypes 
produced by human cells or mink Mv-l-Lu cells were in 
general stable in human serum (Table 1) . These results 
suggested that HT1080, TE671 and Mv-l-Lu cells provided the 
best combination of high lacZ titers and resistance to human 
serum and they were therefore used for the generation .of 
retroviral packaging cells. 



Table l. Titer and stability of lacZ pseudotypes. 



Producer 
cell 


LacZ(MLV-A) 


LacZ{RD114) 


Titer* 


Stability 19 


Titer a 


Stability h 


A2 04 


650 


<3 


1,200 


105 


HeLa 


9 


nd 


2,000 


115 


HOS 


4,500 


6 


23,000 


86 


HT1080 


2,000,000 


26 


400, 000 


129 


MRC-5 


450 


10 


1, 000 


nd 


T24 


350 


nd 


1,200 


nd 


TE671 


15, 000 


2 


90,000 


38 


VERO 


260 


nd 


90 


nd 


D17 


900 


<1 


200, 000 


1 


Mv-l-Lu 


80,000 


1 


200, 000 


120 



a: titration on TES71 cells as lacZ i.u./ml 

b: % of infectivity of human serum- treated viruses compared to fetal calf serum- 
treated viruses 



Example 7 

Construction of an improved gag-pol expression vector. 
A MoMLV gag-pol expression plasmid, CeB (Fig. 1), was 
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derived from pCRIP (Danos et al., Proc. Natl. Acad Aci USA 
(1988) 85: 6460-6464) . Approximately 2 Kb of env sequence 
were removed from pCRIP and the bsr selectable marker, 
conferring resistance to blasticidin S (Izumi et al., 
5 Experimental Cell Research (1991) 197:229-233), was inserted 

74 nts downstream of the gag-pol gene. This 74 nts interval 
had no ATG triplets and was thought to provide an optimal 
distance between the stop codon of the pol reading frame and 
the start codon of the bsr gene to allow re-initiation of 

10 translation (Kozak Mol Cell Biol., 1987, 7: 3438-3445). 

There was no "Kozak" consensus sequence (Kozak Cell, (1986) 
44: 283-292) at the 5' end of the marker gene. Therefore, 
bsr could only be expressed by re-initiatiori of translation 
after the upstream gag-pol gene had been expressed. 

15 Consequently, after transfection of CeB in Mv-1- 

Lu/MFGnlsLacZ (ML) , TE671/MFGnlsLacZ (TEL) or HT1080 cells, 
blasticidin S-resistant bulk populations and most cell 
clones expressed high levels of gag-pol proteins assessed by 
the reverse -transcriptase (RT) activity found in cell 

20 supernatants (Table 2) . Considerably higher RT activities 

were found in bulk populations of CeB-transf ected ML cells 
compared to bulk population of ML cells stably transf ected 
with the parental pCRIP construct. Similarly the RT 
activities of two packaging cell lines generated using 

25 pCRIPenv- construct, psiCRE cells (Danos et al., supra) and 
EB8 cells (Battini supra.) were less than that of CeB 
transfected clones (Table 2). Finally, RT.activitiy in CeB 
transfected cell supernatants was higher than that of cells 
chronically infected by replication-competent MLV-A (Table 

30 2) . 

Table 2. Secreted reverse transcriptase expression 



35 



Cell a 



RT activity 11 



LacZ Titer c 



WO 97/08330 



PCT/GB96/02061 



24 



ML/MLV-A 


1 




8xl0 4 


MLSvB 


0 


.1 


<1 


MLCRIP (bulk) 


0 


.15 


nd 


MLCeB (bulk) 


1 


.7 


nd 


MLCeBl 


4 


.2 


IxlO 6 


MLCeB4 


1 


.6 


1x10 s 


TEL/MLV-A 


3 


.6 


2xl0 6 


TELCeBS 


5 


.2 


4xl0 7 


HT10 80 /MLV-A 


1 


.1 


IxlO 6 


HTCeBS 


1 


.9 


IxlO 6 


HTCeB18 


2 


.7 


2xlG 6 


HTCeB22 (FLY) 


6 


.9 


5xl0 6 


HTCeB48 


5 


.5 


3x10 s 


EB8 


0 


.22 


IxlO 4 


psiCRE-LLZ 


1 


.2 


lxlQ sd 



a: ML, Mv-l-Lu cells harboring a MFGnlslacZ provirus; TEL, TE671 cells harboring 
a MFGnlslacZ provirus; /MLV-A, cells chronically infected' with MLV-A 1504 
strain; MLS vB , ML cells transfected with a plasmid pSV2bsr alone; MLCRIP, ML 
20 cells co- transfected with pCRIP and pSV2bsr. 

b: Average of arbitrary units relative to ML/MLV-A RT activity of at least two 
independent experiments was shown. The standard errors did not exceed 20 % of 
the values. 

c: titration on. TE671 cells as lacZ i.u./ml. After polyclonal transfection of a 
25 plasmid which expresses MLV-A env in MLCeB clones, TELCeB clones, HTCeB clones 

and EB8 cells; nd, not done, 
d: titration on NIH3T3 cells 

To rescue infectious lacZ viruses, MLCeB and TELCeB clones 
30 were transfected with FBASALF DNA, a plasmid designed to 
express the MLV-A env gene (Fig, 1) . Bulk populations of 
stable FBASALF transf ectants were isolated and supernatants " 
were titrated using TE671 cells as targets. Titers of lacZ 
viruses were higher than either MLV-A infected ML or TEL 
35 cells, or FBASALF -transfected EB8 cells (Table 2) . These 

data suggested that CeB was an extremely efficient MLV gag- 
pol expression vector in mink Mv-l-Lu and TE671 cells. CeB 
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was therefore used to derive packaging cells by transfection 
of HT1080 cells. 41/49 blasticidin S-resistant colonies had 
detectable levels of RT; 9 had RT activity higher than that 
of control MLV-A- infected HT1080 cells (data not shown) . 
Expression of gag precursor was confirmed in cell lysates 
and supernatants of these 9 HTCeB clones by immunoblotting 
using antibodies against p30-CA (data not shown) * The 4 
clones with the highest expression of gag proteins (clones 
6,18,22 and 48) were infected at high-multiplicity with 
helper free, lacZ pseudotypes bearing MLV-A envelopes 
(MFGnlslacZ (A) ) produced by TELCeB 6 / FB ASALF (Table 3) and 
then transfected with FBASALF. Supernatants of bulk, 
phleomycin-resistant transf ectants were assessed for RT 
activity and lacZ titer (Table 2). Clone HTCeB 2 2 , named FLY, 
was found to be the best gag-pol producer clone and was used 
to introduce env expression vectors for the generation of 
packaging cell lines. 
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Table 3. Titer following env construct transfection 



10 



15 



20 



25 



30 



35 



40 



45 



Producer cell 


Env source 


Titer* 


psiCRIP lacZ 5 


pCRIPAMgag- 


6xl0 4b 


GP+EAM12 lacZ 25 


envAM 


3xl0 5b 


TELCeB6 


FBASALF C 


5xl0 7 




FBASAF C 


2xl0 7 




FbdelPASAF* 


2xl0 7 


TELCeB6 


FBdelPASAF 1 


3xl0 7 




FbdelPASAF 4 


2xl0 7 




FbdelPASAF 6 


IxlO 7 




r Cue 1 ir/io/ir / 


bXlU 




FbdelPASAF .8 


IxlO 7 




f DaeiirKiJoAr Z 


1X10 




FbdelPRDSAF 4 


3x10 s 




FbdelPRDSAF 7 


IxlO 7 




FbdelPRDSAF 8 


2x10 s 


FLY d 


FBdelPASAF 1 


IxlO 1 




FbdelPASAF 4 


1.5x10 s 




FbdelPASAF 5 


1x10 s 




FbdelPASAF 7 


1x10 s 




FbdelPASAF 13 


7x10 s 




FbdelPASAF 14 


4x10 s 




FbdelPASAF 15 


1x10 s 




FbdelPASAF 16 


5xl0 e 




FbdelPASAF 17 


6x10 s 


FLYA4 lacZ 3 


FBdelPASAF 4 


2x1 0 7b 


FLY d 


FBdelPRDSAF 1 


2.5x10 s 




FbdelPRDSAF 2 


IxlO 7 




FbdelPRDSAF 6 


5xl0 6 




FbdelPRDSAF 10 


2x10 s 




FbdelPRDSAF 11 


3xl0 6 




FbdelPRDSAF 13 


1x10 s 




FbdelPRDSAF 17 


5x10 s 




FbdelPRDSAF 16 


3xl0 7 




FbdelPRDSAF 19 


6x10 s 



Average titers of at least three independent experiments were shown. The 

standard errors did not exceed 30 % of the titer values. 

a: titrated on TES71 cells as lacZ i.u./ml 

b: results of best MFGnlslacZ producer clones. 
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c: bulk populations of env- transf ectants in TELCeBS cells. 

d: titration after bulk infection with helper-free MFGnlslacZ. 

Example 8 

Construction of env expression vectors. 
A series of MLV-A env expression plasmids were then 
generated (Fig. 1) . In FBASALF, the env gene was inserted 
between two Friend-MLV. LTRs, its expression driven by the 
FB29 MLV LTR (Perryman et al . , supra). Most of the packaging 
signal located in the leader region was deleted. This 
plasmid also expressed the phleo selectable marker (Gatignol 
et al., supra) driven by the 3' LTR. FBASAF and FBdelPASAF 
were then designed following the same strategy used for CeB. 
These two vectors differed only by the extent of deletion of 
the packaging signal, FBdelPASAF having virtually no leader 
sequence. Compared to pCRIPAMgag- and pCRIPgag-2 env 
plasmids expressed in psiCRIP or psiCRE packaging cells 
(Danos et al., supra) about 5 Kb of gag-pol sequences was 
removed. In addition the 258 bp retroviral sequence 
containing the end of env gene and the begining of U3 found 
in pCRIPAMgag- and pCRIPgag-2 was also removed. For both 
FBASAF and FBdelPASAF plasmids, the phleo selectable marker 
was inserted downstream of the env gene by positioning a 76 
nts linker with no ATG codons between .the two open-reading 
frames. Phleo could therefore only be expressed by re- 
initiation of translation by the same ribosomal unit that 
had expressed the upstream env open reading frame. 
FBdelPASAF was also used to generate FBdelPRDSAF, an RD114 
envelope expression plasmid (Fig. 1) . 

After transfection of the env plasmids into TELCeB6 cells 
(Table 2) , bulk populations of phleomycin-resistant colonies 
were isolated and their production of lacZ virus measured 
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(Table 3). FBASALF gave a titer of 5xl0 7 lacZ-i .u./ml," whilst 
titers with either FBASAF or FBdelPASAF were 2xl0 7 lacZ- 
i.u./ml (Table 3). Titers of 5xl0 7 or 10 7 lacZ-i.u./ml could 
be obtained with some FBdelPASAF cell clones or FBdelPRDSAF 
5 clones, respectively. 

As FBdelPASAF has minimal virus -derived sequences and was 
shown to be the safest construct (see below and Table 4) , 
it and FBdelPRDSAF were used to generate packaging lines 

10 from FLY cells (clone HTCeB22, Table 2). Envelope expression 
of these clones was assayed by interference to challenge 
with MFGnlslacZ (A) or MFGnlslacZ (RD) pseudotypes produced by 
TELCeB 6 / FBde 1 PASAF - 7 or TELCeB6 /FBdelPRDSAF- 7 , respectively 
(Table 3) . The cell lines showing most interference were 

15 cross-infected at high multiplicity with these pseudotypes 

to provide MFGnlslacZ proviruses, and supernatants were then 
titrated on TE671 cells (Table 3). FLY - FB de 1PAS AF - 1 3 (FLYA13 
packaging line) and FLY- FBdelPRDSAF- 18 (FLYRD18 packaging 
line) gave the highest productions of lacZ viruses, around 

20 10 7 lacZ-i. u./ml. The best MFGnlslacZ producer clones derived 

from either psiCRIP cells (Danos et al. # supra) or GP+EAM12 
cells (Markowitz et al., supra) gave approximately 50 fold 
lower titers (Table 3) . The lacZ titers of the FLY-derived 
lines shown in Table 3 are lower than the best TELCeBS- 

25 derived lines after transfection of either FBdelPASAF or 

FBdelPRDSAF (Table 3). However it should be noted that the 
lacZ provirus expressed in TELCeB 6 cells was obtained after 
clonal selection but was introduced polyclonally in FLY- 
derived env-transfected cell clones, when FLY -FBdelPASAF -4 

30 cells (FLYA4 packaging line) , infected with helper-free 

MFGnlslacZ (RD) , were cloned by limiting dilution the best 
clones (eg. FLYA4lacZ3) were found to produce 20 times more 
infectious viruses than the' bulk population, reaching the 
range of titers obtained with the best TELCeBS- FBdelPASAF 

35 clones (Table 3) . 
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Example 9 

Assays for transfer of gag-pol or env functions. 
To assay for replication-competent viruses, supernatants 
were used to infect TEL cells (a clone of TE671 cells 
harboring an MFGnlslacZ provirus) . Infected cells were 
passaged for 6 days or longer and their supernatants were 
used for infection of fresh TE671 cells. No transmission of 
lacZ viruses could be detected (Table 4) , demonstrating that 
the supernatants of pCRIPAMgag-- , FBASALF- , FBASAF-, or 
FBdelPASAF-transf ected TELCeBS cells were helper- free. 
Similar absence of replication competent recombinant 
retroviruses was demonstrated using supernatant from a 
clone of psiCRIP-MFGnlslacZ cells or from two clones of 
FLYA- MFGnlslacZ cells (Table 4) . 

There have been reports that helper- free retroviral vector 
stocks may nevertheless contain recombinant retroviruses 
(replication incompetent) carrying either gag-pol or env 
genes (Bestwick et al., Proc Natl Acad Sci USA (1988), 85, 
5404-5408, Cosset et al,, Virology (1993), 193, 385-395, 
Girod et al., Virology (1995), in press). To assay for such 
recombinant retroviruses, mobilisation of an MFGnlslacZ 
provirus from two indicator cell lines which could cross - 
complement potential recombinant viruses carrying either 
gag-pol or env functional genes was attempted. The TELCeBS 
line (Table 2) expressing gag-pol proteins was used as 
indicator cell line to test for the presence of env 
recombinant (ER) viruses. The TELMOSAF indicator line 
expressing MoMLV env glycoproteins {obtained by transfection 
of FBMOSAF, .a plasmid expressing the MoMLV env. gene using 
FBASAF backbone, in TEL cells) was used to detect the 
presence of gag-pol recombinant retroviruses (GPR viruses) . 
After passaging 4-8 days, the supernatants of the infected 
indicator cells were used to infect either human TE671 cells 
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or murine NIH3T3 cells'. 

TELCeBG cells transfected with various env-expressing 
constructs, pCRIPAMgag-, FBASAF and FBdelPASAF were 
5 compared. Although the supernatants of TELCeBS- FBdelPASAF 
cells were devoid of replication-competent retroviruses, 
they were found sporadically to transfer gag-pol genomes 
(Table 4) . No GPR viruses could be detected when less than 
2xl0 5 virions were used to infect the indicator cells. 

10 Similarly TELCeB6 indicator cells infected with various 

helper- free viruses were shown sporadically to release lacZ 
virions (Table 4) . The number depended both on the env- 
expression vector used and on the virus input quantity. 
Compared to lacZ viruses generated using pCRIPAMgag- 

15 plasmid, the frequency of detection of the env- recombinant 

viruses was lower for supernatants generated by using FBASAF 
and FBdelPASAF constructs (Table 4) For FBdelPASAF 
construct when less than 5x10 s MFGnlslacZ(A) helper- free 
virions were used to infect the indicator cells, no ER 

20 retroviruses could be detected. From these experiments, it 

could be estimated that a supernatant, produced from 
TELCeB 6 -FBdelPASAF cells, containing IxlO 7 infectious units 
of MFGnlslacZ retroviral vector contained no replication- 
competent virus, and about 10 0 gag-pol and 100 env 

25 recombinant retroviruses. 
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Table 4. Transfer of packaging function 



5 



Producer cell 


Indicator cell Input virus* Detection 0 




(lacZ-i.u.) ++ + 


Replication competent virus 



psiCRIP lacZ 5 


TEL 


2x10* 


0/4 


0/4 


.4/4 


TELCeB6-pCRIPAMgag- 


TEL 


5xl0 6 


0/4 


0/4 


4/4 


TELCeB6-FBASAF 


TEL 


5xl0 6 


0/4 


0/4 


4/4 


TELCeB6-FBdelPASAF 


TEL 


5xl0 6 


0/4 


0/4 


4/4 


FLYA4 lacZ 3 


TEL 


lxlO 7 


0/4 


0/4 


4/4 


FLYA4 lacZ 7 


TEL 


lxlO 7 


0/4 


0/4 


4/4 


Gae-Dol recombinant • 


TELCeB6-FBdelPASAF 7 


TELMOSAF 


2xl0 7 


0/4 


1/4 


3/4 


TELCeB6-FBdeIPASAF 7 


TELMOSAF 


2xl0 6 


0/4 


2/4 


2/4 


TELCeB6-FBdelPASAF 7 


TELMOSAF 


2x10 s 


0/4 


2/4 


2/4 


TELCeB6-FBdelPASAF 7 


TELMOSAF 


2xl0 4 


0/4 


0/4 


4/4 



Env recombinent 



25 


TELCeB6-pCRIPAMgag- 


TELCeB6 


5xl0 6 


2/4 


1/4 


1/4' 




TELCeB6-pCRIPAMgag- 


TELCeB6 


5x10 s 


1/4 


1/4 


2/4 




TELCeB6-pCRIPAMgag- 


TELCeB6 


5x10* • 


0/4 


2/4 


2/4 




TELCeB6-FBASAF 


TELCeB6 


5xl0 6 


0/4 


2/4 


2/4 


30 


TELCeB6-FBASAF 


TELCeB6 


5x10 s 


0/4 


1/4 


3/4 




TELCeB6-FBASAF 


TELCeB6 


5x10* 


0/4 


1/4 


3/4 




TELCeB6-FBdelPASAF 


TELCeB6 


5x10* 


0/4 


1/4 


3/4 




TELCeB6-FBdelPASAF 


TELCeB6 


5x10 s 


1/4 


3/4 


0/4 


35 


TELCeB6-FBdelPASAF 


TELCeB6 


5xl0 4 


0/4 


0/4 


4/4 



a: number of lacZ i,u. used to infect indicator cells 

b: number of incidence out of four experiments. The ranges of lacZ titers 
40 rescued from infected indicator cells are shown for each virus input; >100 

lacZ i.u./ml {++), 1-100 lacZ i.u./ml (+) and <l lacZ i.u./ml (-) . 



Titers were determined on TE671 cells for replication 
competent virus and env recombinant and NIH3T3 cells for 
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gag-pol recombinant. 
Example 10 

In order to confirm resistance to complement and absence of 
5 replication competent virus in our best packaging lines, 
MFGnlslacZ(A) and (RD) harvested from FLYA13 and FLYRD18, 
respectively, after polyclonal transduction of MFGnlslacZ 
(Table 3 above) were tested for stability in fresh human 
serum and generation of replication competent virus. Titers 

10 of MFGnlslacZ (RD) from FLYRD18 after 1 hr incubation with 3 
independent samples of fresh human serum were 80 to 120 % 
of control incubations, while titers of MFGnlslacZ (A) from 
FLYA13 were 50 to 90 % of controls (data not shown) . No 
replication competent virus was detected in the same assay 

15 described above (Table 4) when 1 x 10 7 i.u. each of 
MFGnlslacZ (A) and (RD) were tested. 

EXAMPLE 11. 

20 Generation of plasmids. 

CeB plasmid (Fig. 5) expressing MoMLV gag-pol gene, was 
further modified to remove the splice donor site located in 
the leader region. A 272 bp fragment was PCR-generated by 
using OUSD- (5 ' -TCTCGCTTCTGTTCGCGCGC) and OLSD- 

25 (5 ' -TCGATCAAGCTTGCGGCCGCGGTGGTGGGTCGGTGGTCC) as primers and 

further digested with BssHII and Hindi II. A '1008 bp 
Hindlll-Xhol fragment isolated 'from CeB (encompassing a part 
of leader sequence and beginning MoMLV gag) and the PCR 
fragment were co-inserted into pCeB from which the 1275 bp 

30 BssHII-XhoI fragment (encompassing R-U5- leader- gag) had been 
removed. The resulting- plasmid, named pCeB DS- (Fig. 5), 
beared the deletion of splice donor (SD) site and a NotI 
restriction site created just downstream to the lost SD 
site. 



35 
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A series of gag-pol expression plasmids in which the MoMLV 
LTR promoter was replaced by the human cytomegalovirus 
immediate early promoter (hCMV promoter) was derived from 
both CeB DS- and hCMV-G (Yee et al., 1994 PNAS, 91: 9564- 
9568), a plasmid used as a source for the hCMv* promoter. A 
Notl-filled/EcoRI 7260 bp fragment was isolated from CeB DS- 
and cloned into hCMV-G which had been opened with Sail 
(further rendered blunt-ended) and EcoRI to remove the VSV-G 
gene. The resulting plasmid was cutted with Clal and EcoRI 
to remove a 1155 bp fragment encompassing sequence derived 
from 3 ' -LTR and SV40 polyA sequence and self-ligated after 
filling both protruding DNA ends. The resulting plasmid, . 
named phCMV-intron (Fig. 5) , had gag-pol and bsr ORFs 
inserted between the CMV promoter and rabbit beta-globin 
polyA post -transcriptional regulatory sequences. 

An intermediate plasmid was generated by sub-cloning a 7260 
bp EcoRI fragment (isolated from CeB DS-) into hCMVG opened 
with EcoRI. A 1155 bp fragment (encompassing sequence 
derived from 3 ' -LTR and SV40 polyA sequence) was removed 
from this intermediate plasmid which was then 
re-circularized by self ligation after filling both ends. 
The resulting plasmid, named phCMV+intron 2P (Fig. 5), was 
digested with NotI and the vector was treated with klenow 
enzyme. A 1440 bp fragment (encompassing hCMV promoter and 
rabbit beta-1 globin intron B (Rohrbaugh et al., 1985 Mol. 
Cell Biol, 5: 147-160)) was isolated from phCMV+intron 2P by 
Notl/EcoRI digestion. This fragment was further treated with 
klenow enzyme and ligated back into the vector. The 
resulting plasmid, named hCMV+intron (Fig. 5.) , could express 
gag-pol and bsr genes driven by the hCMV promoter and beared 
an intron sequence derived from rabbit beta-1 globin intron 
B having both SD and SA (splice acceptable) sites. 

A 2450 bp fragment was removed from phCMV+intron 2P by 
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Notl/Xhol digestion. The resulting vector fragment was then 
used to co-ligate a 1330 bp fragment (containing hCMV 
promoter + 5' end of rabbit beta-l globin intron B (with SD 
site) ) isolated from phCMVG by Apal-f illed/Notl digestion 
5 and a 1 kb fragment isolated from phCMV+intron 2P by 

Notl-f illed/XhoI digestion. Compared to phCMV+ intron 2P, the 
resulting plasmid, named hCMV+SD intron (Fig. 5) , had the 
deletion of the 3' end of the rabbit beta-l globin intron B 
and thus no SA site in the leader region. 

10 

Construct phCMV+leader (Fig. 5) has been described elsewhere 
(Savard et al . , unpublished). This plasmid, in which gag-pol 
and bsr genes were driven by the hCMV promoter, had the 
MoMLV SD site in the leader region. 

15 

Gag-pol expression. 

The different constructs, including the parental CeB 
plasmid, were analysed comparatively in a complementation 
assay after transfection in TEL - FBde 1 PASAF cells expressing 

20 4070A-MLV (amphotropic) envelope and harboring a MFGnlslacZ 

provirus . The transient production of lacZ retroviruses as 
well as the stable production of lacZ retroviral vectors 
after selection with blasticidin S were determined (Table 
5) . All the constructs were able to rescue infectious lacZ 

25 retroviruses indicating the expression of gag-pol proteins 

after transient transfection. Most likely due to the 
efficient hCMV and rabbit beta-l globin intron B 
(post) -transcriptional regulatory sequences, hCMV+intron was 
particularly potent in transient retroviral vector 

30 production. However, 10 times less blasticidin-resistant 

colonies were obtained with hCMV+intron comparatively to 
CeB, and stable lacZ virus production from hCMV+intron was 
about 5-10 times lower than that of CeB. Clonal examination 
of lacZ retrovirus production from blasticidin-resistant 

35 colonies indicated that 80-90% of colonies could express 
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high levels of gag-pol proteins for both hCMV+intron and CeB 
plasmids. In contrast, despite variation in their ability to 
form blasticidin-resistant colonies after transfection and 
despite their ability to express gag-pol proteins from 
transient transfectants, all other constructs had a weak 
capacity for rescuing lacZ retroviral vectors from stable 
transfectants (Table 5) . 



Table 5. Comparative study of gag-pol -bsr plasmids. 



gag-pol -bsr 
plasmid 


Transient 
UacZ 
i . u . /ml ) 


no clones 
bsr* 


Stable 
(lacZ 
i .u. /ml 


% gag-pol 
/bsr 


Ceb 


3 00 /ml 


50 


10 7 


90% 


Ceb DS- 


144 /ml 


5 


10 s 


50% 


hCMV+intron 
2P 


ND 


20 


10 6 


50% 


hCMV- intron 


812 /ml 


0 






hCMV+SD 
intron 


150/ml 


1000 


10 2 


nd 


hCMV+leader 


328/ml 


1000 


10 2 -10 3 


nd 


hCMV+ intron 


12000/ml 


5 


10 6 -10 7 


80% • 



Northern blot analyses were performed on stable 
transfectants (blasticidin-resistant) obtained with some of 
the gag-pol-bsr plasmids. As expected, the results (not 
shown) displayed a correlation between expression of gag-pol 
mRNAs and gag-pol protein expression detected by rescue 
analysis (Table 5) . CeB construct was found to produce 2-3 
fold more gag-pol mRNAs compared to hCMV+intron. 
Interestingly, an unexpected 2.45 kb RNA band .was found for 
hCMV+ intron construct at a ratio of 2:1 compared to the 
abundancy of the gag-pol mRNA band (at 5.95 kb) . Further 
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investigations by using other probes revealed that a cryptic 
splice donnor (SD) site located in the gag gene (right in 
the middle of the CA coding region at position 1596-1597 
-numbering according to Shinnick et al., 1981 Nature 

5 (London) 293; 543-548) was activated in this latter 

construct. The 2.45 RNA species, lacking the 3' half of the 
gag gene and most of the pol gene, is unlikely to give rise 
to any useful translational product. It is therefore 
interesting to notice that hCMV+intron construct was able to 

0 give rise to slightly more transcripts (gag-pol 5.95 mRNA + 
2.45 alternative RNA band) compared to gag-pol mRNA 
expressed from CeB construct. Therefore we decided to 
inactivate the cryptic SD site in the hCMV+intron construct 
in order to increase the ratio of gag-pol mRNAs. 

5 

Assays for transfer of gag-pol functions . 

Although the supernatants of pacakaging cell lines generated 
with CeB gag-pol expression contruct were devoid of • 
replication-competent retroviruses, they were found 

0 sporadically to transfer gag-pol genomes (example 9, Table 
4) (Cosset et al., 1995 J. Virol 69: 7430-7436). Because 
gag-pol -bsr constructs generated here by using the hCMV 
promoter had much less retroviral sequences homologous to 
the retroviral vector than the parental CeB construct {.Fig. 

5 5) , they are less likely to give rise to gag-pol recombinant 

(GPR) viruses. Therefore, the most efficient gag-pol-bsr 
plasmids, hCMV+intron and CeB , were further analysed for 
emergence of GPR viruses. To assay for such recombinant 
retroviruses, we attempted to mobilise an lacZ provirus from 

0 an indicator cell lines which could cross -complement 

potential recombinant viruses carrying gag-pol functional 
genes. Results displayed in Table 6 showed that consistently 
with data reported previously (example 9, Table 4) (Cosset et 
al., 1995 Supra), lacZ retrovirus vectors generated by using 

5 CeB gag-pol construct were contaminated with GPR viruses. In 
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contrast lacZ retrovirus vectors generated by using 
hCMV+intron construct were completely devoid of such GPR 
viruses, suggesting that this construct was 'improved 
compared to CeB with respects with emergence of recombinant 
viruses . 



Table 6. Comparative study of gag-pol-bsr plasmids. 



plasmid 


input virus 
(lacZ i.u.) a 


no of experiments 
giving titres of b 


CeB 


5xl0 6 


5 


3 


0 




5x10 s 


2 


4 


2 




5X10 4 


0 


1 


7 


hCMV+intron 


5X10 6 


0 


0 


a 




5x10 s 


0 


0 


8 




5xl0 4 


0 


0 


8 



4X10E4 cells of TEL/MOSAF in 24 veils were challenged with lacZ (A) of i.u. 
indicated in the table (a) , and incubated at 37°C for 3 days. Cells were 
trypsinised and transferred into small flasks. Cell sup was harvested on day 5 
after lacZ(A) challenge and plated on either TE571 (not shown) and 3T3 cells 
ih) . No lacZ was mobilized into TE671 at all. LacZ (A) from CMV-int 10 again 
did not rescue lacZ from TEL/MOSAF. 

Example 12 

Generic primers to detect D-type {Medstrand and Blomberg 
J.Virol. (1993) 67:6778-6787) , C-type (Shih et al. r J 
Virol. (1989) 63:64-75) , human endogenous virus RTVL-H 
(Wilkinson et al., J.Virol. (1993) 67:2981-2989), by RT-PCR 
were employed (Patience et al., supra). Primers to detect 
mouse endogenous VL30 element (Adams et al Mol. Cel. Biol. 
(1988) 8:2989-2998) , and MFGnlslacZ RNA were designed and 
synthesized (TABLE X) . Overnight supernatants (in 4ml of 
culture medium) from 106 cells of GP+EAM121ac225 , FLYA4lac23 
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and TELCeB S FB ASALF cells (Table 3) were harvested and 
centrifuged in sucrose gradient as described previously 
(Patience et -al. , J.Virol., 70:2654-2657). Fractions 
containing retrovirus particles were collected, and RNA 
5 extracted. One twentieth of the RNA preparation or 

dilution's thereof were applied to RT-PCR as described 
previously (Table X) . A 1/200 of RNA harvested from 
GP+EAM121acZ25 cells was positive for VL30 RNA. MFGnlslacZ 
RNA was found from 1/20 of RNA from GP+EAM12lacZ and 
10 TELCeB 6 FB ASALF cells and 1/200 of RNA from FI,YA4lacZ3 cells. 

The primer combinations for 'RTVL-H, C- and D-type RNA did 
not give detectable PCR product. 

Table 7. RT-PCR detection of endogenous retrovirus RNA 
15 associated with virus particles. 



rt-pcr of virion associated RUA from" 



20 



RNA primer (5' -3') GP+EAM12 FLYA4 TELCeB 6 F 

forward (F) /reverse (R) lacZ25 lacZ3 3 ASALF 



25 



MFGnls F) CTCTGGCT C ACAGT ACGACG TAG 

lacZ R) CCATCAATC CGGTAGGTTTTC CG 



30 



C - typ e F } CARRGKTTCAARAACWS YCCCAC 

R) AGYARVGTAGCHGGGTTHAGG 

D - type F) TCCCCTTGGAATACTCCTGTTTTYGT 

R) CATTCCTTGTGGTAAAACTTTCCAYTG 



35 



RTVL-H F) CCTCACCCTGATCACRYTTG 

R) GAATTATGTCTGACAGAAGGG 



NT 



VL3 0 F) GTTGACATCTGCAGAGAAAGACC ++ 

R) TCTGAGGTCTGTACACACAATGG 



NT 



NT 
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a:-, not detected; + detected in 1/20 RNA preparation; ++ detected in 1/200 RNA 
preparation; NT, not tested because the cells do not possess the corresponding 
genes . 

EXAMPLE 13. 

Generation of gag-pol pre-packaging cells by using TE671 
cells . 

CeB, a plasmid designed to over-express MoMLV gag and pol 
proteins was introduced in TE671 human rhabdomyosarcoma 
cells (ATCC CRL8805) . After selection with blasticidin, 50 
bsr-positive colonies were isolated and the RT (reverse 
transcriptase) activity was analysed in their supernatants . 
12 TE671-CeB (TECeB) clones with high RT activity were 
selected for further analysis. The best TECeB clone, clone 
#15, had a RT activity roughly equivalent to that TELCeBG 
cells (Cosset et al., J. Virol. 69:7430-7436 (1995); see 
also Example 7, Table 6 in this patent application) but 
displayed 2-3 fold more gag-precursors into cells as 
demonstrated in immunoblots by using anti-CA antibodies. The 
biological activity of gag-pol proteins expressed in the six 
best TECeB clones was further confirmed by their ability to 
produce infectious retroviruses in a complementation assay. 
A lacZ provirus was introduced into each of the TECeB clones 
by polyclonal cross-infection by using lacZ(RD114) helper- 
free retrovirus vectors. FBMOSALF, a MoMLV env expression 
plasmid (Cosset et al., J. Virol. 69:6314-6322), was then 
transfected in each of the TECeB-lacZ lines and in the 
TELCeB6 cell line for comparison. After selection with 
phleomycin, the titer of lacZ retrovirus vectors was 
determined in the supernantant of pools of phleomycin- 
resistant colonies for each TECEB-lacZ- FBMOSALF lines. A 
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good correlation was found between gag-pol expression into 
the TE-CeB clones (as determined by RT-assays and anti-gag 
immunoblots) and their ability to release infectious lacZ 
particles. TE-CeB15 cells could release approximately the 
same number of lacZ particles when compared to TELCeB6 cells 
although TELCeB6 cells had the advantage of being selected 
for lacZ expression (Cosset et al,, J. Virol, 69:7430-7436 
(1995)). TE-CeB15 cells were therefore used to derive 
retroviral packaging cell lines. 

Construction of env-expression plasmids, 

A series of plasmid (Fig. 3) was designed to allow 
expression of different retroviral envelope genes (isolated 
from MoMLV, GALV -Gibbon Ape Leukemia Virus-, and MLV-10A1) . 
FBdelPMOSAF (Fig. 3, nucleotide sequence in -Fig. 10) and 
FBdelPlOAlSAF, expressing ecotropic MoMLV or MLV-10A1 
envelopes, were generated by replacing the Bglll/Clal 
fragment from FBdelPASAF (Cosset. et al., J. Virol. 69:7430- . 
7436 (1995); see also Example 7, Fig. 2 and nucleotide 
sequence in Fig. 9) encompassing most of the env gene and 
splice acceptor site with that of MoMLV (position 5407 to 
7679, Shinnik et al., 1981) or with that of MLV-10A1 ' (Ott et 
al., J. Virol. 64:757-766 (1990)). 

Nucleotides 7514-7516 of GALV (Delassus et al., Virology 
173:205-213 (1989)) were mutated by PCR-mediated mutagenesis 
to create a Clal site (AAG to CGA) , thereby introducing a 
conservative modification (a lysine (amino-acid 665 of GALV 
env precursor) to an arginine) . The BamHI/Clal fragment (nts 
4994 (Delassus et al. Virology 173:205-213 (-1989)) to 7517) 
was then sub-cloned into FBdelPASAF in which the Bglll/Clal 
encompassing most of the env gene and splice acceptor site 
had been removed. The resulting plasmid, expressing GALV 
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envelope glycoproteins, was named FBdelPGASAF (Fig. 3/ 
nucleotide sequence in Fig. 11) . 

CMV10A1 was generated by inserting a Klenow enzyme-filled 
Eagl/Sall fragment from FBdelPlOAlSAF (encompassing 10A1 MLV 
env gene and phleo selectable marker) into hCMV-G digested 
.with BamHI and filled with Klenow enzyme. The resulting 
plasmid, CMV10A1 (Fig. 3 and nucleotide sequence in Fig. 13) 
could express 10A1 envelopes under control of the hCMV 
promoter and the phleo selectable marker by translation re- 
initiation. 

Generation of a multi-tropic set of TE671-based retroviral 
packaging lines. 

FBdelPRDSAF (Fig. 3, nucleotide sequence in Fig. 12), 
FBdelPASAF, FBdelPGASAF, FEdelPMOSAF and FBdelPlOAlSAF were 
independently introduced into cells of the TE-CeB15 pre- 
packaging line, expressing MoMLV gag-pol proteins. 
Transfected cells were phleomycin-selected and 15-20 phleo- * 
resistant colonies were isolated for each env-expression 
plasmid transfected. 

Individual colonies were then analysed for expression of 
envelope glycoproteins by immunoblots on cell lysates by 
using antibodies against RD114 SU glycoproteins or against 
Rausher leukemia virus SU (to screen MoMLV, MLV-4070A and 
MLV-10A1 env-producer clones) or against GALV. The best env- 
producer colonies as determined in this assay were further 
analysed by a complementation assay after introducing a lacZ 
retroviral vector. LacZ pseudotypes released from the 
different packaging cell lines were titrated by using NIH 
3T3 cells or TE671 cells as target. Titers higher than lxlO 7 
lacZ i.u./ml were obtained for the best clones. Depending on 
the envelope specificities expressed in these cells, the new 
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TE671-based retroviral packaging- cell lines were named TE- 
FLYE, TE-FLYA, TE-FLYRD, TE-FLY10A1, and TE-FLYGA and could 
express the MoMLV, MLV-4070A, RD114, MLV-10A1, and GALV env 
genes, respectively. 

.Assays for detecting replication-competent retroviruses 
(RCRs) were performed in the supernatants of these cells and 
were negative (less than 1/ml) . 

TE671 cells are very potent for transient expression 
resulting in more than 95% of cells expressing transgene 
three days after plasmid transfection (Hatziioannou and 
Cosset, unpublished data, (1996)). The ability of retroviral 
packaging cell lines to transiently produce retroviral 
vectors is of crucial importance for gene therapy where 
vectors carrying toxic gene have to be prepared. Transient 
expression of retroviral vectors was comparatively 
determined from cells of the TE-FLYA line and from the BING 
line (Pear et al., Proc Natl Acad Sci USA 90, 8392-6 
(1993)), a retroviral packaging cell line designed to 
transiently express retroviral vectors. Results (Table 8) 
showed that 'TE-FLYA cells were more efficient for transient, 
expression of a lacZ retroviral vector hence resulting in 
higher titers. 



Table 8 . Comparative study of transient production of lacZ 
vectors . 



packaging 
cell line 


cell number* 


% transfected 
cells b 


transient 
titer 0 


BING 


281 


5.3 


2xl0 2 


TE-FLYA 


117 


35 


1.3xl0 3 



Cells were transfected by MFGnlsiacZ retroviral vectors with calcium phosphate 
precipitation method and titers of of lacZ vectors (c) released in cell 



WO 97/08330 



PCT/GB96/02061 



43 

supernatant were determined as lacZ i.u./ml at day 3 following transection. The 
relative number of cells (a) (average per microscope field) and the % of 
transfected cells (b) determined after X-gal staining are shown. 

Retroviral vectors prepared from TE671-based packaging cell 
lines were analysed for their sensitivity to human- 
complement mediated inactivation. Experiments were conducted 
as previously described (Cosset et al., J. Virol. 69:7430- 
7436 (1995); see also Example 10 in this patent application) 
by using three human sera of individual donnors (Table 9) . 
As expected MLV-A prepared from mouse 3T3 cells were highly 
sensitive to inactivation after 1 hr incubation witn sera. 
In contrast, titers of lacZ vectors produced from TE-FLYRD 
cells were 17 to 55% of control incubations, while titers of 
lacZ vectors from TE-FLYA cells were 1 to 30% of controls. 



Table 9 . Htiman serum sensitivity of viruses produced from 
TE 671 -based packaging cell lines.. 



Virus from: 


hu56* 


hu57 a 


BTS* 


3T3/A 


<0.2, <0,2 


<0.2, <0.2 


<0.2, <0,2 


TE-FLYE 


15, 7.8 


16, 11 


48, 60 


TE-FLYA 


1, 0.6 


2.2, 7.1 


28, 19 


TE-FLYRD 


17, 22 


30, 44 


54, 63 



Three human fresh serum samples were tested in- duplicate; hu56 , huST (AB+) , 
BTS(AB+). (a) % control (average for FCS and opt i -MEM treatment) is shown. 
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CLAIMS : 

1. A recombinant expression vector comprising a gene of 
interest and a selectable marker gene, wherein the 
selectable marker gene is arranged downstream of the 
gene of interest and a st6p codon associated with the 
gene of interest is spaced from a start codon of said 
selectable marker gene at a distance which is 
sufficient to ensure that said selectable marker 
protein is expressed from the corresponding mRNA as a 
result of translation reinitiation, 

2. A recombinant expression vector according to claim 1 
wherein the vector is a viral vector. 

3 . A recombinant expression vector according to claim 2 
wherein the vector is a retroviral vector. 

4 . A recombinant expression vector according to any one 
of claims 1 to 3 wherein the gene of interest is 
included as part of a viral packaging construct. 

5. A recombinant expression vector according to any one 
of the preceding claims wherein the number of 
nucleotides in the space between the stop codon of the 
gene of interest and the start codon of the selectable 
marker is in the range of from 20 to 200 nucleotides. 

6. A recombinant expression vector according to claim 5 
wherein the number of nucleotides in the space between 
the stop codon of the gene of interest and the start 
codon of the selectable marker is in the range of from 
60 to 80 nucleotides. 

7. A process for producing a cell line in which a gene of 
interest is expressed, which process comprises: 

transforming host cells with an expression vector 
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according to any one of the claims 1 to 6; and 

selectable those cells where expression of the 
selection marker gene may be detected. 

8 . A process according to claim 7 wherein the host cell 
is a eukaryotic cell. 

9. A host cell transformed with a recombinant expression 
vector according to any one of the claims 1 to 6 . 

10. A retroviral . packaging cell line comprising a host 
cell transformed with a first and a second recombinant 
expression vector, said first recombinant expression 
vector having a packaging-deficient construct 
comprising a viral gag-pol gene and a first selectable 
marker gene downstream thereof, and said second 
recombinant expression vector having a packaging- 
deficient construct comprising a viral env gene and a 
second selectable marker gene downstream thereof; 
wherein the start codon of the first and second 
selectable markers are spaced from the stop codons of 
the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that said 
selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation. 

11. A retroviral packaging cell line according to claim 10 
wherein the first selectable marker is a bsr 
selectable marker and the second selectable marker is 
a phleo selectable marker. 

12. A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral gag-pol gene and first 
selectable marker is the CeB (SEQ ID No 2) expression 
construct . 
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13 . A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral env gene and second 
selectable marker is the FBdelPASAF (SEQ ID No 5) , 
the FBdelPMOSAF (SEQ ID No 6) , the FbdelPGASAF (SEQ ID 
No 7), the FbdelPRDSAF (SEQ ID No 8), the FbdelPXSAF 
(Fig. 3), the FbdelPlOAlSAF (Fig. 3), or the 
FBde 1PVSVGSAF (Fig. 3) expression construct. 

14 . A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the recombinant expression 
vector is a packaging-deficient retroviral helper 
construct . 

15. A retroviral packaging cell line according to claim 14 
wherein the overlapping sequences between the genomes 
of the retroviral vector and the packaging-deficient 
construct is reduced by minimizing the extent of non- 
coding retroviral sequences in the packaging-deficient 
genome . 

16 . A retroviral packaging cell line according to any one 
of claims 10 to 15 wherein the viral gag-pol gene and 
the selectable marker are expressed under the control 
of a non- retroviral promoter. 

17 . A retroviral packaging cell line according to claim 16 
wherein the promoter is fused to rabbit beta-1 globin 
intron . 

18 . A retroviral packaging cell line according to claim 16 
or claim 17 wherein the promoter is a hCMV promoter. 

19. A retroviral packaging cell line according to any one 
of claims 16 to claim 18 wherein the viral gag-pol 
gene and the selectable marker is "a hCMV+ intron (SEQ 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08330 



PCT/GB96/02061 



47 

ID No3) or a hCMV+intronkaSD (SEQ ID No 4) expression 
construct . 

20. A retroviral packaging cell line according to anyone 
of claims 10 to 15 wherein the viral env gene and the 
selectable marker are under the control of a non- 
retroviral promoter. 

21. A retroviral packaging cell line according to claim 20 
wherein the promoter is fused to rabbit beta-1 globin 
intron. 

22. A retroviral packaging cell line according to claim 20 
or claim 21 wherein the promoter is a hCMV promoter. 

23 . A retroviral packaging cell line according any one of 

claims 20 to 22 wherein the viral env gene and the 
selectable marker is a CMV10A1 (SEQ ID No 9) 
expression construct. 

24 . A retroviral packaging cell line according to any one 

of claims 10 to 23 wherein the cell line is the HT1080 
line, the TE671 line/ the 3T3 line, the 293 line or 
the MV-1-1U line. 

25 . A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human HT1080 cells and express RD114 
envelopes . 

26. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human TE671 cells and express RD114 
envelopes . 
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27. A process for producing a retroviral packaging cell 
line in which a gene of interest in expressed, which 
process comprises: 

transforming host cells with a first and a second 
recombinant expression vector, said first recombinant 
expression vector having a packaging-deficient 
construct comprising a viral gag-pol gene and a first 
selectable marker gene downstream thereof, and said 
second recombinant expression vector having a 
packaging-deficient construct comprising a viral env 
gene and a second selectable marker gene downstream 
thereof; wherein the start codon of the first and 
second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env 
gene respectively by a distance which ensures that 
said selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation; and 

selecting transformed cells which express said 
first and/or second marker genes. 

28. A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral gag-pol 
gene and a selectable marker wherein a start codon of 
the selectable marker is spaced from a stop codon of 
the viral gag-pol gene by a distance which ensures 
that said selectable marker protein is expressed from 
the corresponding mRNA as a result of translation 
reinitiation. 

29. A packaging deficient construct for use in a process 
according to claim 27, which expresses a viral env 
gene and a • selectable marker gene; wherein a start 
codon of the selectable marker is spaced from a stop 
codon of the viral env gene by a distance which 
ensures that said selectable marker protein is 
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expressed from the corresponding mRNA as a result of 
translation reinitiation. 
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. del psi 



MLV-LTR 




i 

SD 



Mo MLV GAG-POL 



SA 



BSR 



□- 



poly A 



pel cene... TCT AGA CTG ACA 73 G CG'C 
GTT CAA CGC TCT CAA AAC CCC TTA 
AAA ATA AGG TTA ACC CGC GAG GCC 
CCC TAA 

iccccitaattcttctC2tgctcagaggggtcagtac 
tQCttcgcccQCCtccaQtac QQCccacccQQ ccacc 
AT® AAA ACA TTT AAC ATT TCT... bsrgene 



Figure 1. Schematic structure of CeB expression vector 
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.del psi 







/XT ▼ 






FB29 LTR 






ENV [— 


PHLEO 


poly A 




SD 











SA 



env gene...TAC GAG CCA TAG 
qqcqcc tagtgttgacaattaatcatcggcatagtata 
cggcatagtataatacgactcactataggagggccacc 
MG CC AAG TTG ACC.phleo gene 



Figure 2. Schematic structure of FBdelPASF expression vector 
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F3delPASAF 



F529 LTR 



4070A MLV env 



™ Kasl 

-Ub— * 

ohleo H 



polyA 



Msd/Bglll 



FBdeiPMOSAF 
v 



F329 LTR 



Gat Kasl 

— = r 



Mo MLV env 



onleo 



poiyA 



Msct/Sclli 



F3delPRDSAF 



F329 LTR 



Kss! 



sc3c RD1 1 4 env f^H ohleo 



potyA 



Mscl/Ndel 



FBdelPBSAF 



Kasl 



F529 LTR 



FslV-3 env KH shiecT polyA 



Mscl/3amKI 
F3de)PXSAF 



F329 LTR 



™ Kasl 



NZ3 MLV-X env j ^- jchieo r - poiy- 



Msd/Bglll 



F3delP10A1SAF 
v 



Gal 



Kasl 



: 329 LTR 



1 OA 1 MLV env ">Hsnieo"T - palyA 



Mscl/Sglll 



FBdelPGASAF 

T7 



F329 LTR 



Sal Kasl 



GALV env ohleo ( - .polyA 



Mscl/SsmHl 



F3delPVSVGSAF Kasl 
v 



F329 LTR 



VSV-G 



polyA 



Mscl/Sgtil 



CMV1 0A1SAF 



Gal 



y 0 1 



10A1 MLV env Honleo r - poiyA 



Kasl 

lit 



aamHI/Eagl 



Figure 3. Schematic structure of env expression vectors 
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NG AGCTC AGG AC AGGTAG AAAG AATGAATAG AAC AATAAAAG AG ACCCTT ACT AAATTG A 6 0 
CCTTAGAGACTGGCTTAAAAGATTGGAGACGCCTCCTATCTCTGGCTTTGTTAAGAGCCA 1 20 
GAAATACGCCCAACCGTTTTCGGCTCACCCCATATGAAATCCTTTATGGGGGACCCCCCC 180 
CTTTGTCAACCTTGCTCAATTCCTTCTCCCCCTCCGATCCTAAGACTGATTTACAAGCCC 2 40 
GACTAAAAGGGCTGC AAGGCGTGCAGGCCCAAATCTGG ACACCCCTGGCCGAATTGT ACC 300 
GGCCAGGACATCCACAAACTAGCCACCCATTTCAGGTGGGAGACTCCGTGTACGTCCGGC 3 60 
GGCACCGCTCTCAAGGATTGGAGCCTCGTTGGAAGGGACCTTAC ATCGTCCTGCTGACCA 420 
CGCCCACCGCC AT AAAGGTTGACGGGATCGCCGCCTGGATTC AC GC ATCGCACGCCAAGG 480 
CAGCCCCAAAAACCCCTGGACCAGAAACTCCCAAAACCTGGAAGCTCCGCCGTTCGGAGA 540 
ACCCTCTT AAGAT AAGACTCTCCCGTGTCTGACTGCTAATCCACCTTGTCCCTGTACTAA 600 
CCC AAAATGAAACTCCCAACAGGAATGGTCATTTTATGTAGCCT AATAAT AGTTCGGGC A 660 
GGGTTTGACGACCCCCGCAAGGCT ATCGCATTAGTACAAAAACAACATGGTAAACCATGC 720 
GAATGCAGCGGAGGGCAGGTATCCGAGGCCCCACCGAACTCCATCCAACAGGTAACTTGC 730 
CC AGGC AAGACGGCCT ACTTAATGACCAACCAAAAATGGAAATGCAGAGTC ACTCC AAAA 840 
ATCTCACCTAGCGGGGGAGAACTCCAGAACTGCCCCTGTAACACTTTCCAGGACTCGATG 900 
CACAGTTCTTGTTATACTGAATACCGGCAATGCAGGCGAATTAATAAGACATACTACACG 960 
GCCACCTTGCTTAAAATACGGTCTGGGAGCCTCAACGAGGTACAGATATTACAAAACCCC 1020 
AATCAGCTCCTACAGTCCCCTTGTAGGGGCTCTATAAATCAGCCCGTTTGCTGGAGTGCC 1080 
ACAGCCCCCATCC ATATCTCCGATGGTGGAGGACCCCTCGATACTAAGAGAGTGTGGACA 11*0 
GTCC AAAAAAGGCTAGAACAAATTC AT AAGGCT ATGACTCCTGAACTTCAATACCACCCC 1200 
TT AGCCCTGCCCAAAGTC AGAGATG ACCTTAGCCTTGATGC ACGGACTTTTGAT ATCCTG 1260 
AATACCACTTTTAGGTTACTCC AGATGTCCAATTTTAGCCTTGCCCAAGATTGTTGGCTC 1320 
TGTTTAAAACTAGGTACCCCTACCCCTCTTGCGATACCCACTCCCTCTTTAACCTACTCC 1380 
CTAGCAGACTCCCTAGCGAATGCCTCCTGTCAGATTATACCTCCCCTCTTGGTTCAACCG 1440 
ATGCAGTTCTCCAACTCGTCCTGTTTATCTTCCCCTTTCATTAACGATACGGAACAAATA 1500 
GACTTAGGTGCAGTCACCTTTACTAACTGCACCTCTGTAGCCAATGTCAGTAGTCCTTTA 1560 
TGTGCCCTAAACGGGTCAGTCTTCCTCTGTGGAAATAACATGGCATACACCTATTTACCC 1620 
CAAAACTGGACCAGACTTTGCGTCCAAGCCTCCCTCCTCCCCGACATTGACATCAACCCG 1680 
GGGGATGAGCCAGTCCCCATTCCTGCCATTG ATCATTATATACATAGACCTAAACG AGCT 1740* 
GTAC AGTTCATCCCTTTACTAGCTGGACTGGGAATCACCGCAGCATTCACCACCGGAGCT 1800 
AC AGGCCTAGGTGTCTCCGTC ACCC AGTAT ACAAAATT ATCCCATC AGTTAATATCTGAT 1860 
GTCCAAGTCTTATCCGGTACCATACAAGATTTACAAGACCAGGTAGACTCGTTAGCTGAA 1920 
G T AGTTC TCC AAAAT AGGAGGGG AC TGGACC T ACTAACGGC AGAAC AAGG AGGAAT TTGT 1980 
TT AGCCTTACAAGAAAAATGCTGTTTTTATGCTAACAAGTCAGGAATTGTGAGAAAC AAA 2040 
ATAAGAACCCTACAAGAAGAATTACAAAAACGCAGGGAAAGCCTGGCAACCAACCCTCTC 2100 
TGGACCGGGCTGCAGGGCTTTCTTCCGTACCTCCT ACCTCTCCTGGGACCCCT ACTC ACC 2160 
CTCCTACTCATACTAACCATTGGGCCATGCGTTTTCAGTCGCCTCATGGCCTTCATTAAT 2220 
G ATAGACTTAATGTTGTAC ATGCCATGGTGCTGGCCC AGCAATACC AAGC ACTC AAAGCT 2280 
G AGGAAGAAGCTC AGGATTGAGCTTCCGGGACAAAAGC AGGGGGGAATGAGAAGTC AGAA 2340 
CCCCCCACCTTTGCTACATAAATAACCGCTTTCATTTCGCTTCTGTAAAACGCTTATGCG 2400 
CCCCACCCTAGCCGGAAAGTCCCCAGCCGCTACGCAACCCGGGCCCCGAGTTGC ATCAGC 2460 
CGTTCGCAACCCGGGCTCCGAGTTGCATCAGCCGAAAGAAACTTC ATTTCCCAAGCTT 2518 



Fig. 4 
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Figure 6, CeB Sequence 8/22 

AATGAAAGAC CCCACCTGTA GGTTTGGCAA GCTAGCTTAA GTAACGCCAT TTTGCAAGGC - 60 

ATGGAAAAAT ACATAACTGA GAATAGAGAA GTTCAGATCA AGGTCAGGAA CAGATGGAAC 120 

AGCTGAATAT GGGCCAAACA GGATATCTGT GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC 180 
AAGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG • 240 

CCCCGGCTCA GGGCCAAGAA CAGATGGTCC CCAGATGCGG TCCAGCCCTC AGCAGTTTCT 300 

AGAGAACCAT CAGATGTTTC CAGGGTGCCC CAAGGACCTG AAATGACCCT GTGCCTTATT 360 

TGAACTAACC AATCAGTTCG CTTCTCGCTT CTGTTCGCGC GCTTCTGCTC CCCGAGCTCA 420 

ATAAAAGAGC CCACAACCCC TCACTCGGGG CGCCAGTCCT CCGATTGACT GAGTCGCCCG 480 

GGTACCCGTG TATCCAATAA ACCCTCTTGC AGTTGCATCC GACTTGTGGT CTCGCTGTTC 540 

CTTGGGAGGG TCTCCTCTGA GTGATTGACT ACCCGTCAGC GGGGGTCTTT CATTTGGGGG 600 

CTCGTCCGGG ATCGGGAGAC CCCTGCCCAG GGACCACCGA CCCACCACCG GGAGGTAAGC 660 

TGGAAGCTTC TGCAGCATCG TTCTGTGTTG TCTCTGTCTG ACTGTGTTTC TGTATTTGTC 720 

TGAGAATATG GGCCAGACTG TTACCACTCC CTTAAGTTTG ACCTTAGGTC ACTGGAAAGA 780 

TGTCGAGCGG ATCGCTCACA ACCAGTCGGT AGATGTCAAG AAGAGACGTT GGGTTACCTT 840 

CTGCTCTGCA GAATGGCCAA CCTTTAACGT CGGATGGCCG CGAGACGGCA CCTTTAACCG 900 

AGACCTCATC ACCCAGGTTA AGATCAAGGT CTTTTCACCT GGCCCGCATG GACACCCAGA 960 

CCAGGTCCCC TACATCGTGA CCTGGGAAGC CTTGGCTTTT GACCCCCCTC CCTGGGTCAA 1020 

GCCCTTTGTA CACCCTAAGC CTCCGCCTCC TCTTCCTCCA TCCGCCCCGT CTCTCCCCCT 1080 

TGAACCTCCT CGTTCGACCC CGCCTCGATC CTCCCTTTAT CCAGCCCTCA CTCCTTCTCT 1140 

AGGCGCCAAA CCTAAACCTC AAGTTCTTTC TGACAGTGGG GGGCCGCTCA TCGACCTACT 1200 

TACAGAAGAC CCCCCGCCTT ATAGGGACCC AAGACCACCC CCTTCCGACA GGGACGGAAA 1260 

TGGTGGAGAA GCGACCCCTG CGGGAGAGGC ACCGGACCCC TCCCCAATGG CATCTCGCCT 1320 

ACGTGGGAGA CGGGAGCCCC CTGTGGCCGA CTCCACTACC TCGCAGGCAT TCCCCCTCCG 13 80 

CGCAGGAGGA AACGGACAGC TTCAATACTG GCCGTTCTCC TCTTCTGACC TTTACAACTG 1440 

GAAAAATAAT AACCCTTCTT TTTCTGAAGA TCCAGGTAAA CTGACAGCTC TGATCGAGTC 1500 

TGTTCTCATC ACCCATCAGC CCACCTGGGA CGACTGTCAG CAGCTGTTGG GGACTCTGCT 1560 

GACCGGAGAA GAAAAACAAC GGGTGCTCTT AGAGGCTAGA AAGGCGGTGC GGGGCGATGA 1620 

TGGGCGCCCC ACTCAACTGC CCAATGAAGT CGATGCCGCT TTTCCCCTCG AGCGCCCAGA 1680 

CTGGGATTAC ACCACCCAGG CAGGTAGGAA CCACCTAGTC CACTATCGCC AGTTGCTCCT 1740 

AGCGGGTCTC CAAAACGCGG GCAGAAGCCC CACCAATTTG GCCAAGGTAA AAGGAATAAC 1800 

ACAAGGGCCC AATGAGTCTC CCTCGGCCTT CCTAGAGAGA CTTAAGGAAG CCTATCGCAG 1860 

GTACACTCCT TATGACCCTG AGGACCCAGG GCAAGAAACT AATGTGTCTA TGTCTTTCAT 1920 

TTGGCAGTCT GCCCCAGACA TTGGGAGAAA GTTAGAGAGG TTAGAAGATT TAAAAAACAA 19S0 

GACGCTTGGA GATTTGGTTA GAGAGGCAGA AAAGATCTTT AATAAACGAG AAACCCCGGA 2040 

AGAAAGAGAG GAACGTATCA GGAGAGAAAC AGAGGAAAAA GAAGAACGCC GTAGGACAGA 2100 

GGATGAGCAG AAAGAGAAAG AAAGAGATCG TAGGAGACAT AGAGAGATGA GCAAGCTATT 2160 

GGCCACTGTC GTTAGTGGAC AGAAACAGGA TAGACAGGGA GGAGAACGAA GGAGGTCCCA 2220 

ACTCGATCGC GACCAGTGTG CCTACTGCAA AGAAAAGGGG CACTGGGCTA AAGATTGTCC 2280 

CAAGAAACCA CGAGGACCTC GGGGACCAAG ACCCCAGACC TCCCTCCTGA .CCCTAGATGA 2340 

CTAGC-GAGGT CAGGGTCAGG AGCCCCCCCC TGAACCCAGG ATAACCCTCA AAGTCGGGGG 2400 

GCAACCCGTC ACCTTCCTGG TAGATACTGG GGCCCAACAC TCCGTGCTGA CCCAAAATCC 2460 

TGGACCCCTA AGTGATAAGT CTGCCTGGGT CCAAGGGGCT ACTGGAGGAA AGCGGTATCG 2520 

CTGGACCACG GATCGCAAAG TACATCTAGC TACCGGTAAG GTCACCCACT CTTTCCTCCA 2 580 

TGTACCAGAC TGTCCCTATC CTCTGTTAGG AAGAGATTTG CTGACTAAAC TAAAAGCCCA 2 640 

AATCCACTTT GAGGGATCAG GAGCTCAGGT TATGGGACCA ATGGGGCAGC CCCTGCAAGT 2700 

GTTGACCCTA AATATAGAAG ATGAGCATCG GCTACATGAG ACCTCAAAAG AGCCAGATGT 2760 

TTCTCTAGGG TCCACATGGC TGrCTGATTT TCCTCAGGCC TGGGCGGAAA CCGGGGGCAT 2820 

GGGACTGGCA GTTCGCCAAG CTCCTCTGAT CATACCTCTG AAAGCAACCT CTACCCCCGT 2880 

GTCCATAAAA CAATACCCCA TGTCACAAGA AGCCAGACTG GGGATCAAGC ' CCCACATACA 2 940 

GAGACTGTTG GACCAGGGAA TACTGGTACC CTGCCAGTCC CCCTGGAACA CGCCCCTGCT 3 000 

ACCCGTTAAG AAACCAGGGA CTAATGATTA TAGGCCTGTC CAGGATCTGA GAGAAGTCAA 3060 

CAAGCGGGTG GAAGACATCC ACCCCACCGT GCCCAACCCT TACAACCTCT TGAGCGGGCT 3120 

CCCACCGTGC CACCAGTGGT ACACTGTGCT TGATTTAAAG GATGCCTTTT TCTGCCTGAG 3180 
ACTCCACCCC ACCAGTCAGC CTCTCTTCGC CTTTGAGTGG AGAGATCCAG AGATGGGAAT • 3240 

CTCAGGACAA TTGACCTGGA CCAGACTCCC ACAGGGTTTC AAAAACAGTC CCACCCTGTT 3300 

TGATGAGGCA CTGCACAGAG ACCTAGCAGA CTTCCGGATC CAGCACCCAG ACTTGATCCT 3360 

GCTACAGTAC GTGGATGACT TACTGCTGGC CGCCACTTCT GAGCTAGACT GCCAACAAGG 3 420 

TACTCGGGCC CTGTTACAAA CCCTAGGGAA CCTCGGGTAT CGGGCCTCGG CCAAGAAAGC 3 480 

CCAAATTTGC CAGAAACAGG TCAAGTATCT GGGGTATCTT CTAAAAGAGG GTCAGAGATG 3540 

GCTGACTGAG GCCAGAAAAG AGACTGTGAT GGGGCAGCCT ACTCCGAAGA CCCCTCGACA 3600 

ACTAAGGGAG TTCCTAGGGA CGGCAGGCTT CTGTCGCCTC TGGATCCCTG GGTTTGCAGA 3660 

AATGGCAGCC CCCTTGTACC CTCTCACCAA AACGGGGACT CTGTTTAATT GGGGCCCAGA 3720 

CCAACAAAAG GCCTATCAAG AAATCAAGCA AGCTCTTCTA ACTGCCCCAG CCCTGGGGTT 3 780 

GCCAGATTTG ACTAAGCCCT TTGAACTCTT TGTCGACGAG AAGCAGGGCT ACGCCAAAGG 3840 

TGTCCTAACG CAAAAACTGG GACCTTGGCG TCGGCCGGTG GCCTACCTGT CCAAAAAGCT 3900 

AGACCCAGTA GCAGCTGGGT GGCCCCCTTG CCTACGGATG GTAGCAGCCA TTGCCGTACT 3960 

GACAAAGGAT GCAGGCAAGC TAACCATGGG ACAGCCACTA GTCATTCTGG CCCCCCATGC 4020 

AGTAGAGGCA CTAGTCAAAC AACCCCCCGA CCGCTGGCTT TCCAACGCCC GGATGACTCA 4080 
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Figure 6. CeB Sequence 9/22 2 

CTATCAGGCC TTGCTTTTGG ACACGGACCG GGTCCAGTTC GGACCGGTGG TAGCCCTGAA 4140 
CCCGGCTACG CTGCTCCCAC TGCCTGAGGA AGGGCTGCAA GACAACTGCC TTGATATCCT ' 4200 

GGCCGAAGCC CACGGAACCC GACCCGACCT AACGGACCAG CCGCTCCCAG ACGCCGACCA 4260 

CACCTGGTAC ACGGATGGAA GCAGTCTCTT- ACAAGAGGGA CAGCGTAAGG CGGGAGCTGC 4320 

GGTGACCACC GAGACCGAGG TAATCTGGGC TAAAGCCCTG CCAGCCGGGA CATCCGCTCA 4380 

GCGGGCTGAA CTGATAGCAC TCACCCAGGC CCTAAAGATG GCAGAAGGTA AGAAGCTAAA 4440 

TGTTTATACT GATAGCCGTT ATGCTTTTGC TACTGCCCAT ATCCATGGAG AAATATACAG 4500 

AAGGCGTGGG TTGCTCACAT CAGAAGGCAA AGAGATCAAA AATAAAGACG AGATCTTGGC 4560 

CCTACTAAAA GCCCTCTTTC TGCCCAAAAG ACTTAGCATA ATCCATTGTC CAGGACATCA 4620 

AAAGGGACAC AGCGCCGAGG CTAGAGGCAA CCGGATGGCT GACCAAGCGG CCCGAAAGGC 4680 

AGCCATCACA GAGACTCCAG ACACCTCTAC CCTCCTCATA GAAAATTCAT CACCCTACAC 4740 

CTCAGAACAT TTTCATTACA CAGTGACTGA TATAAAGGAC CTAACCAAGT TGGGGGCCAT 4800 

TTATGATAAA ACAAAGAAGT ATTGGGTCTA CCAAGGAAAA CCTGTGATGC CTGACCAGTT 4860 

TACTTTTGAA TTATTAGACT TTCTTCATCA GCTGACTCAC CtCAGCTTCT CAAAAATGAA 4920 

GGCTCTCCTA GAGAGAAGCC ACAGTCCCTA CTACATGCTG AACCGGGATC GAACACTCAA 4980 

AAATATCACT GAGACCTGCA AAGCTTGTGC ACAAGTCAAC GCCAGCAAGT CTGCCGTTAA 5040 

ACAGGGAACT AGGGTCCGCG GGCATCGGCC CGGCACTCAT TGGGAGATCG ATTTCACCGA 5100 

GATAAAGCCC GGATTGTATG GCTATAAATA TCTTCTAGTT TTTATAGATA CCTTTTCTGG 5160 

CTGGATAGAA GCCTTCCCAA CCAAGAAAGA AACCGCCAAG GTCGTAACCA AGAAGCTACT 5220 

AGAGGAGATC TTCCCCAGGT TCGGCATGCC TCAGGTATTG GGAACTGACA ATGGGCCTGC 5280 

CTTCGTCTCC AAGGTGAGTC AGACAGTGGC CGATCTGTTG GGGATTGATT GGAAATTACA 5340 

TTGTGCATAC AGACCCCAAA GCTCAGGCCA GGTAGAAAGA ATGAATAGAA CCATCAAGGA 5400 

GACTTTAACT AAATTAACGC TTGCAACTGG CTCTAGAGAC TGGGTGCTCC TACTCCCCTT 5460 

AGCCCTGTAC CGAGCCCGCA ACACGCCGGG CCCCCATGGC CTCACCCCAT ATGAGATCTT 5520 

ATATGGGGCA CCCCCGCCCC TTGTAAACTT CCCTGACCCT GACATGACAA GAGTTACTAA 5580 

CAGCCCCTCT CTCCAAGCTC ACTTACAGGC TCTCTACTTA GTCCAGCACG AAGTCTGGAG 5640 

ACCTCTGGCG GCAGCCTACC AAGAACAACT GGACCGACCG GTGGTACCTC ACCCTTACCG 5700 

AGTCGGCGAC ACAGTGTGGG TCCGCCGACA CCAGACTAAG AACCTAGAAC CTCGCTGGAA 5760 

AGGACCTTAC ACAGTCCTGC TGACCACCCC CACCGCCCTC AAAGTAGACG GCATCGCAGC 5820 

TTGGATACAC GCCGCCCACG TGAAGGCTGC CGACCCCGGG GGTGGACCAT CCTCTAGACT 5880 

GACATGGCGC GTTCAACGCT CTCAAAACCC CTTAAAAATA AGGTTAACCC GCGAGGCCCC 5940 

CTAATCCCCT TAATTCTTCT GATGCTCAGA GGGGTCAGTA CTGCTTCGCC CGGCTCCAGT 6000 

GCGGCCCAGC CGGCCACCAT GAAAACATTT AACATTTCTC AACAAGATCT AGAATTAGTA 6060 

GAAGTAGCGA CAGAGAAGAT TACAATGCTT TATGAGGATA ATAAACATCA TGTGGGAGCG 6120 

GCAATTCGTA CGAAAACAGG AGAAATCATT TCGGCAGTAC ATATTGAAGC GTATATAGGA 6180 • 

CGAGTAACTG TTTGTGCAGA AGCCATTGCG ATTGGTAGTG CAGTTTCGAA TGGACAAAAG 6240 

GATTTTGACA CGATTGTAGC TGTTAGACAC CCTTATTCTG ACGAAGTAGA TAGAAGTATT 6300 

CGAGTGGTAA GTCCTTGTGG TATGTGTAGG GAGTTGATTT CAGACTATGC ACCAGATTGT 63 60 

TTTGTGTTAA TAGAAATGAA TGGCAAGTTA GTCAAAACTA CGATTGAAGA ACTCATTCCA 6420 

CTCAAATATA CCCGAAATTA AAAGTTTTAC CACCAAGCTT ATCGATTAGT CCAATTTGTT 6480 

AAAGACAGGA TATCAGTGGT CCAGGCTCTA GTTTTGACTC AACAATATCA CCAGCTGAAG 6540 

CCTATAGAGT ACGAGCCATA GATAAAATAA AAGATTTTAT TTAGTCTCCA GAAAAAGGGG 6600 

GGAATGAAAG ACCCCACCTG TAGGTTTGGC AAGCTAGCTT AAGTAACGCC ATTTTGCAAG 6660 

GCATGGAAAA ATACATAACT GAGAATAGXG AAGTTCAGAT CAAGGTCAGG AACAGATGGA 6720 

ACAGTCGAGA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA 6780 

AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 6840 

AATGTATCTT ATCATGTCTG GATCCCCAGG AAGCTCCTCT GTGTCCTCAT AAACCCTAAC 6900 

CTCCTCTACT TGAGAGGACA TTCCAATCAT AGGCTGCCCA TCCACCCTCT GTGTCCTCCT 6960 

GTTAATTAGG TCACTTAACA AAAAGGAAAT TGGGTAGGGG TTTTTCACAG ACCGCTTTCT 7020 

AAGGGTAATT TTAAAATATC TGGGAAGTCC CTTCCACTGC TGTGTTCCAG AAGTGTTGGT 7080 

AAACAGCCCA CAAATGTCAA CAGCAGAAAC ATACAAGCTG TCAGCTTTGC ACAAGGGCCC 7140 

AACACCCTGC TCATCAAGAA GCACTGTGGT TGCTGTGTTA GTAATGTGCA AAACAGGAGG 7200 

CACATTTTCC CCACCTGTGT AGGTTCCAAA ATATCTAGTG TTTTCATTTT TACTTGGATC 7260 

AGGAACCCAG CACTCCACTG GATAAGCATT ATCCTTATCC AAAACAGCCT TGTGGTCAGT 7320 

GTTCATCTGC TGACTGTCAA CTGTAGCATT TTTTGGGGTT ACAGTTTGAG CAGGATATTT 7380 

GGTCCTGTAG TTTGC TAACA CACCCTGCAG CTCCAAAGGT TCCCCACCAA CAGCAAAAAA 7440 

ATGAAAATTT GACCCTTGAA TGGGTTTTCC AGCACCATTT TCATGAGTTT TTTGTGTCCC 7500 

TGAATGCAAG TTTAACATAG CAGTTACCCC AATAACCTCA GTTTTAACAG TAACAGCTTC 7560 

CCACATCAAA ATATTTCCAC AGGTTAAGTC CTGATTTAAA TTAGGCAAAG GAATTC 7616 
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AGATCTCCCG ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA - 60 
AGCCAGTATC TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT 120 
TAAGCTACAA CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG 180 
CGTTTTGCGC TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT 240 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 300 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 360 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 420 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 480 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 540 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 600 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 660 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 720 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 780 
CGGTGGGAGG TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTAACTG 840 
GCTTATCGAA ATGTCGACTG AGAACTTCAG GGTGAGTTTG GGGACCCTTG ATTGTTCTTT 90Q 
CTTTTTCGCT ATTGTAAAAT TCATGTTATA TGGAGGGGGC AAAGTTTTCA GGGTGTTGTT 960 

TAGAATGGGA AGATGTCCCT TGTATCACCA TGGACCCTCA TGATAATTTT GTTTCTTTCA 1020 

CTTTCTACTC TGTTGACAAC CATTGTCTCC TCTTATTTTC TTTTCATTTT CTGTAACTTT 1080 

TTCGTTAAAC TTTAGCTTGC ATTTGTAACG AATTTTTAAA TTCACTTTTG TTTATTTGTC 1140 

AGATTGTAAG TACTTTCTCT AATCACTTTT TTTTCAAGGC AATCAGGGTA TATTATATTG 1200 

TACTTCAGCA CAGTTTTAGA GAACAATTGT TATAATTAAA TGATAAGGTA GAATATTTCT 1260 

GCATATAAAT TCTGGCTGGC GTGGAAATAT TCTTATTGGT AGAAACAACT ACATCCTGGT 1320 

CATCATCCTG CCTTTCTCTT TATGGTTACA ATGATATACA CTGTTTGAGA TGAGGATAAA 1380 

ATACTCTGAG TCCAAACCGG GCCCCTCTGC TAACCATGTT CATGCCTTCT TCTTTTTCCT 1440 

ACAGCTCCTG GGCAACGTGC TGGTTGTTGT GCTGTCTCAT CATTTTGGCA AGAATTGGCC 1500 

GCAAGCTTCT GCAGCATCGT TCTGTGTTGT CTCTGTCTGA CTGTGTTTCT GTATTTGTCT 1560 

GAGAATATGG GCCAGACTGT TACCACTCCC TTAAGTTTGA CCTTAGGTCA CTGGAAAGAT 1620 

GTCGAGCGGA TCGCTCACAA CCAGTCGGTA GATGTCAAGA AGAGACGTTG GGTTACCTTC 1680 

TGCTCTGCAG AATGGCCAAC CTTTAACGTC GGATGGCCGC GAGACGGCAC CTTTAACCGA 1740 

GACCTCATCA CCCAGGTTAA GATCAAGGTC TTTTCACCTG GCCCGCATGG ACACCCAGAC 18 00 

CAGGTCCCCT ACATCGTGAC CTGGGAAGCC TTGGCTTTTG ACCCCCCTCC CTGGGTCAAG I860 

CCCTTTGTAC ACCCTAAGCC TCCGCCTCCT CTTCCTCCAT CCGCCCCGTC TCTCCCCCTT 1920 

GAACCTCCTC GTTCGACCCC GCCTCGATCC TCCCTTTATC CAGCCCTCAC TCCTTCTCTA 1980 

GGCGCCAAAC CTAAACCTCA AGTTCTTTCT GACAGTGGGG GGCCGCTCAT CGACCTACTT 2 040 

ACAGAAGACC CCCCGCCTTA TAGGGACCCA AGACCACCCC CTTCCGACAG GGACGGAAAT 2100 

GGTGGAGAAG CGACCCCTGC GGGAGAGGCA CCGGACCCCT CCCCAATGGC ATCTCGCCTA 2160 

CGTGGGAGAC GGGAGCCCCC TGTGGCCGAC TCCACTACCT CGCAGGCATT CCCCCTCCGC 2220 

GCAGGAGGAA ACGGACAGCT TCAATACTGG CCGTTCTCCT CTTCTGACCT TTACAACTGG 2280 

AAAAATAATA ACCCTTCTTT TTCTGAAGAT CCAGGTAAAC TGACAGCTCT GATCGAGTCT 2340 

GTTCTCATCA CCCATCAGCC CACCTGGGAC GACTGTCAGC AGCTGTTGGG GACTCTGCTG 2400 

ACCGGAGAAG AAAAACAACG GGTGCTCTTA GAGGCTAGAA AGGCGGTGCG GGGCGATGAT 2460 

GGGCGCCCCA CTCAACTGCC CAATGAAGTC GATGCCGCTT TTCCCCTCGA GCGCCCAGAC 2520 

TGGGATTACA CCACCCAGGC AGGTAGGAAC CACCTAGTCC ACTATCGCCA GTTGCTCCTA 2580 

GCGGGTCTCC AAAACGCGGG CAGAAGCCCC ACCAATTTGG CCAAGGTAAA AGGAATAACA 2640 

CAAGGGCCCA ATGAGTCTCC CTCGGCCTTC CTAGAGAGAC TTAAGGAAGC CTATCGCAGG 2700 

TACACTCCTT ATGACCCTGA GGACCCAGGG CAAGAAACTA ATGTGTCTAT GTCTTTCATT 2760 

TGGCAGTCTG CCCCAGACAT TGGGAGAAAG TTAGAGAGGT TAGAAGATTT AAAAAACAAG 2820 

ACGCTTGGAG ATTTGGTTAG AGAGGCAGAA AAGATCTTTA ATAAACGAGA AACCCCGGAA 2880 

GAAAGAGAGG AACGTATCAG GAGAGAAACA GAGGAAAAAG AAGAACGCCG TAGGACAGAG 2940 

GATGAGCAGA AAGAGAAAGA AAGAGATCGT AGGAGACATA GAGAGATGAG CAAGCTATTG 3 000 

GCCACTGTCG TTAGTGGACA GAAACAGGAT AGACAGGGAG GAGAACGAAG GAGGTCCCAA 3060 

CTCGATCGCG ACCAGTGTGC CTACTGCAAA GAAAAGGGGC ACTGGGCTAA AGATTGTCCC 3120 

AAGAAACCAC GAGGACCTCG GGGACCAAGA CCCCAGACCT CCCTCCTGAC CCTAGATGAC 3180 

TAGGGAGGTC AGGGTCAGGA GCCCCCCCCT GAACCCAGGA TAACCCTCAA AGTCGGGGGG 3240 

CAACCCGTCA CCTTCCTGGT AGATACTGGG GCCCAACACT CCGTGCTGAC CCAAAATCCT 3300 

GGACCCCTAA GTGATAAGTC TGCCTGGGTC CAAGGGGCTA CTGGAGGAAA GCGGTATCGC 3360 

TGGACCACGG ATCGCAAAGT ACATCTAGCT ACCGGTAAGG TCACCCACTC TTTCCTCCAT 3420 

GTACCAGACT GTCCCTATCC TCTGTTAGGA AGAGATTTGC TGACTAAACT AAAAGCCCAA 3480 

ATCCACTTTG AGGGATCAGG AGCTCAGGTT ATGGGACCAA TGGGGCAGCC CCTGCAAGTG 3540 

TTGACCCTAA ATATAGAAGA TGAGCATCGG CTACATGAGA CCTCAAAAGA GCCAGATGTT 3 600 

TCTCTAGGGT CCACATGGCT GTCTGATTTT CCTCAGGCCT GGGCGGAAAC CGGGGGCATG 3 660 

GGACTGGCAG TTCGCCAAGC TCCTCTGATC ATACCTCTGA AAGCAACCTC TACCCCCGTG 3720 

TCCATAAAAC AATACCCCAT GTCACAAGAA GCCAGACTGG GGATCAAGCC CCACATACAG 3780 

AGACTGTTGG ACCAGGGAAT ACTGGTACCC TGCCAGTCCC CCTGGAACAC GCCCCTGCTA 3840 

CCCGTTAAGA AACCAGGGAC TAATGATTAT AGGCCTGTCC AGGATCTGAG AGAAGTCAAC 3900 

AAGCGGGTGG AAGACATCCA CCCCACCGTG CCCAACCCTT ACAACCTCTT GAGCGGGCTC 3 960 

CCACCGTCCC ACCAGTGGTA CACTGTGCTT GATTTAAAGG ATGCCTTTTT CTGCCTGAGA 4020 

CTCCACCCCA CCAGTCAGCC TCTCTTCGCC TTTGAGTGGA GAGATCCAGA GATGGGAATC 4080 
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TCAGGACAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC CACCCTGTTT * 4140 

GATGAGGCAC TGCACAGAGA CCTAGCAGAC TTCCGGATCC AGCACCCAGA CTTGATCCTG 4200 

CTACAGTACG TGGATGACTT ACTGCTGGCC GCCACTTCTG AGCTAGACTG CCAACAAGGT 4260 

ACTCGGGCCC TGTTACAAAC CCTAGGGAAC CTCGGGTATC GGGCCTCGGC CAAGAAAGCC 4320 

CAAATTTGCC AGAAACAGGT CAAGTATCTG GGGTATCTTC TAAAAGAGGG TCAGAGATGG 4380 

CTGACTGAGG CCAGAAAAGA GACTGTGATG GGGCAGCCTA CTCCGAAGAC CCCTCGACAA 4440 

CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTTTGCAGAA 4500 

ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGGACTC TGTTTAATTG GGGCCCAGAC 4560 

CAACAAAAGG CCTATCAAGA AATCAAGCAA GCTCTTCTAA CTGCCCCAGC CCTGGGGTTG 4620 

CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCTA CGCCAAAGGT 4680 

GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG CCTACCTGTC CAAAAAGCTA 4740 

GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCAT TGCCGTACTG 4800 

ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATGCA 4860 

GTAGAGGCAC TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATGACTCAC 4920 

TATCAGGCCT TGCTTTTGGA CACGGACCGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAC 4980 

CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 5040 

GCCGAAGCCC ACGGAACCCG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCGACCAC 5100 

ACCTGGTACA CGGATGGAAG CAGTCTCTTA CAAGAGGGAC AGCGTAAGGC GGGAGCTGCG 5160 

GTGACCACCG AGACCGAGGT AATCTGGGCT AAAGCCCTGC CAGCCGGGAC ATCCGCTCAG 5220 

CGGGCTGAAC TGATAGCACT CACCCAGGCC CTAAAGATGG CAGAAGGTAA GAAGCTAAAT 5280 

GTTTATACTG ATAGCCGTTA TGCTTTTGCT ACTGCCCATA TCCATGGAGA AATATACAGA 5340 

AGGCGTGGGT TGCTCACATC AGAAGGCAAA GAGATCAAAA ATAAAGACGA GATCTTGGCC 5400 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 5460 

AAGGGACACA GCGCCGAGGC TAGAGGCAAC CGGATGGCTG ACCAAGCGGC CCGAAAGGCA 5520 

GCCATCACAG AGACTCCAGA CACCTCTACC CTCCTCATAG AAAATTCATC ACCCTACACC 5580 

TCAGAACATT TTCATTACAC AGTGACTGAT ATAAAGGACC TAACCAAGTT GGGGGCCATT 5640 

TATGATAAAA CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTGTGATGCC TGACCAGTTT 5700 

ACTTTTGAAT TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 5760 

GCTCTCCTAG AGAGAAGCCA CAGTCCCTAC TACATGCTGA ACCGGGATCG AACACTCAAA 582 0 

AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGCCGTTAAA 5880 

CAGGGAACTA GGGTCCGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 5940 

ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TTATAGATAC CTTTTCTGGC 6000 

TGGATAGAAG CCTTCCCAAC CAAGAAAGAA ACCGCCAAGG TCGTAACCAA GAAGCTACTA 6060 

GAGGAGATCT TCCCCAGGTT CGGCATGCCT CAGGTATTGG GAACTGACAA TGGGCCTGCC 6120 

TTCGTCTCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG GAAATTACAT 6180 

TGTGCATACA GACCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG 6240 

ACTTTAACTA AATTAACGCT TGCAACTGGC TCTAGAGACT GGGTGCTCCT ACTCCCCTTA 6300 

GCCCTGTACC GAGCCCGCAA CACGCCGGGC CCCCATGGCC TCACCCCATA TGAGATCTTA 6360 

TATGGGGCAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTACTAAC 6420 

AGCCCCTCTC TCCAAGCTCA CTTACAGGCT CTCTACTTAG TCCAGCACGA AGTCTGGAGA 6480 

CCTCTGGCGG CAGCCTACCA AGAACAACTG GACCGACCGG TGGTACCTCA CCCTTACCGA 6540 

GTCGGCGACA CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCGCTGGAAA 6600 

GGACCTTACA CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CATCGCAGCT 6660 

TGGATACACG CCGCCCACGT GAAGGCTGCC GACCCCGGGG GTGGACCATC CTCTAGACTG 6720 

ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 6780 

TAATCCCCTT AATTCTTCTG ATGCTCAGAG GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 6840 

CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 6900 

AAGTAGCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA TAAACATCAT GTGGGAGCGG 6960 

CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG TATATAGGAC 7020 

GAGTAACTGT TTGTGCAGAA GCCATTGCGA TTGGTAGTGC AGTTTCGAAT GGACAAAAGG 7080 

ATTTTGACAC GATTGTAGCT GTTAGACACC CTTATTCTGA CGAAGTAGAT AGAAGTATTC 7140 

GAGTGGTAAG TCCTTGTGGT ATGTGTAGGG AGTTGATTTC AGACTATGCA CCAGATTGTT 7200 

TTGTGTTAAT AGAAATGAAT GGCAAGTTAG TCAAAACTAC GATTGAAGAA CTCATTCCAC 72 60 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 7308 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTCTACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
ATACTCTGAG 
ACAGCTCCTG 
GCAAGCTTCT 
GAGAATATGG 
GTCGAGCGGA 
TGCTCTGCAG 
GACCTCATCA 
CAGGTCCCCT 
CCCTTTGTAC 
GAACCTCCTC 
GGCGCCAAAC 
ACAGAAGACC 
GGTGGAGAAG 
CGTGGGAGAC 
GCAGGAGGAA 
AAAAATAATA 
GTTCTCATCA 
ACCGGAGAAG 
GGGCGCCCCA 
TGGGATTACA 
GCGGGTCTCC 
CAAGGGCCCA 
TACACTCCTT 
TGGCAGTCTG 
ACGCTTGGAG 
GAAAGAGAGG 
GATGAGCAGA 
GCCACTGTCG 
CTCGATCGCG 
AAGAAACCAC 
TAGGGAGGTC 
CAACCCGTCA 
GGACCCCTAA 
TGGACCACGG 
GTACCAGACT 
ATCCACTTTG 
TTGACCCTAA 
TCTCTAGGGT 
GGACTGGCAG 
TCCATAAAAC 
AGACTGTTGG 
CCCGTTAAGA 
AAGCGGGTGG 
CCACCGTCCC 
CTCCACCCCA 



ATCCCCTATG 
TGCTCCCTGC 
CAAGGCAAGG 
TGCTTCGCGA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
ATGTCGACTG 
ATTGTAAAAT 
AGATGTCCCT 
TGTTGACAAC 
TTTAGCTTGC 
TACTTTCTCT 
CAGTTTTAGA 
TCTGGCTGGC 
CCTTTCTCTT 
TCCAAACCGG 
GGCAACGTGC 
GCAGCATCGT 
GCCAGACTGT 
TCGCTCACAA 
AATGGCCAAC 
CCCAGGTTAA 
ACATCGTGAC 
ACCCTAAGCC 
GTTCGACCCC 
CTAAACCTCA 
CCCCGCCTTA 
CGACCCCTGC 
GGGAGCCCCC 
ACGGACAGCT 
ACCCTTCTTT 
CCCATCAGCC 
AAAAACAACG 
CTCAACTGCC 
CCACCCAGGC 
AAAACGCGGG 
ATGAGTCTCC 
ATGACCCTGA 
CCCCAGACAT 
ATTTGGTTAG 
AACGTATCAG 
AAGAGAAAGA 
TTAGTGGACA 
ACCAGTGTGC 
GAGGACCTCG 
AGGGTCAGGA 
CCTTCCTGGT 
GTGATAAGTC 
ATCGCAAAGT 
GTCCCTATCC 
AGGGATCAGG 
ATATAGAAGA 
CCACATGGCT 
TTCGCCAAGC 
AATACCCCAT 
ACCAGGGAAT 
AACCAGGGAC 
AAGACATCCA 
ACCAGTGGTA 
CCAGTCAGCC 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAACTTCAG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
TCTGTGTTGT 
TACCACTCCC 
CCAGTCGGTA 
CTTTAACGTC 
GATCAAGGTC 
CTGGGAAGCC 
TCCGCCTCCT 
GCCTCGATCC 
AGTTCTTTCT 
TAGGGACCCA 
GGGAGAGGCA 
TGTGGCCGAC 
TCAATACTGG 
TTCTGAAGAT 
GACCTGGGAC 
GGTGCTCTTA 
CAATGAAGTC 
AGGACGCAAC 
CAGAAGCCCC 
CTCGGCCTTC 
GGACCCAGGG 
TGGGAGAAAG 
AGAGGCAGAA 
GAGAGAAACA 
AAGAGATCGT 
GAAACAGGAT 
CTACTGCAAA 
GGGACCAAGA 
GCCCCCCCCT 
AGATACTGGG 
TGCCTGGGTC 
ACATCTAGCT 
TCTGTTAGGA 
AGCTCAGGTT 
TGAGCATCGG 
GTCTGATTTT 
TCCTCTGATC 
GTCACAAGAA 
ACTGGTACCC 
TAATGATTAT 
CCCCACCGTG 
CACTGTGCTT 
TCTCTTCGCC 



AGTACAATCT 
GAGGTCGCTG 
AATTGCATGA 
AGATATACGC 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TGGCTAACTA 
GGTGAGTTTG 
TGGAGGGGGC 
TGGACCCTCA 
TCTTATTTTC 
AATTTTTAAA 
TTTTCAAGGC 
TATAATTAAA 
TCTTATTGGT 
ATGATATACA 
TAACCATGTT 
GCTGTCTCAT 
CTCTGTCTGA 
TTAAGTTTGA 
GATGTCAAGA 
GGATGGCCGC 
TTTTCACCTG 
TTGGCTTTTG 
CTTCCTCCAT 
TCCCTTTATC 
GACAGTGGGG 
AGACCACCCC 
CCGGACCCCT 
TCCACTACCT 
CCGTTCTCCT 
CCAGGTAAAC 
GACTGTCAGC 
GAGGCTAGAA 
GATGCCGCTT 
CACCTAGTCC 
ACCAATTTGG 
CTAGAGAGAC 
CAAGAAACTA 
TTAGAGAGGT 
AAGATCTTTA 
GAGGAAAAAG 
AGGAGACATA 
AGACAGGGAG 
GAAAAGGGGC 
CCCCAGACCT 
GAACCCAGGA 
GCCCAACACT 
CAAGGGGCTA 
ACCGGTAAGG 
AGAGATTTGC 
ATGGGACCAA 
CTACATGAGA 
CCTCAGGCCT 
ATACCTCTGA 
GCCAGACTGG 
TGCCAGTCCC 
AGGCCTGTCC 
CCCAACCCTT 
GATTTAAAGG 
TTTGAGTGGA 



GCTCTGATGC 

AGTAGTGCGC 

AGAATCTGCT 

GTTGACATTG 

GCCCATATAT 

CCAACGACCC 

GGACTTTCCA 

ATCAAGTGTA 

CCTGGCATTA 

TATTAGTCAT 

AGCGGTTTGA 

TTTGGCACCA 

AAATGGGCGG 

GAGAACCCAC 

GGGACCCTTG 

AAAGTTTTCA 

TGATAATTTT 

TTTTCATTTT 

TTCACTTTTG 

AATCAGGGTA 

TGATAAGGTA 

AGAAACAACT 

CTGTTTGAGA 

CATGCCTTCT 

CATTTTGGCA 

CTGTGTTTCT 

CCTTAGGTCA 

AGAGACGTTG 

GAGACGGCAC 

GCCCGCATGG 

ACCCCCCTCC 

CCGCCCCGTC 

CAGCCCTCAC 

GGCCGCTCAT 

CTTCCGACAG 

CCCCAATGGC 

CGCAGGCATT 

CTTCTGACCT 

TGACAGCTCT 

AGCTGTTGGG 

AGGCGGTGCG 

TTCCCCTCGA 

ACTATCGCCA 

CCAAGGTAAA 

TTAAGGAAGC 

ATGTGTCTA.T 

TAGAAGATTT 

ATAAACGAGA 

AAGAACGCCG. 

GAGAGATGAG 

GAGAACGAAG 

ACTGGGCTAA 

CCCTCCTGAC 

TAACCCTCAA 

CCGTGCTGAC 

CTGGAGGAAA 

TCACCCACTC 

TGACTAAACT 

TGGGGCAGCC 

CCTCAAAAGA 

GGGCGGAAAC 

AAGCAACCTC 

GGATCAAGCC 

CCTGGAACAC 

AGGATCTGAG 

ACAACCTCTT 

ATGCCTTTTT 

GAGATCCAGA 



CGCATAGTTA 


60 


GA.GCAAAATT 


120 


TAGGGTTAGG 


180 


ATTATTGACT 


240 


GGAGTTCCGC 


300 


CCGCCCA'TTG 


360 


TTGACGTCAA 


420 


TCATATGCCA 


480 


TGCCCAGTAC 


540 


CGCTATTACC 


600 


CTCACGGGGA 


660 


AAATCAACGG 


720 


TAGGCGTGTA 


780 


TGCTTAACTG 


S40 


ATTGTTCTTT 


900 


GGGTGTTGTT 


960 


GTTTCTTTCA 


1020 


CTGTAACTTT 


1080 


TTTATTTGTC 


1140 


TATTATATTG 


1200 


GAATATTTCT 


1260 


ACATCCTGGT 


1320 


TGAGGATAAA 


1380 


TCTTTTTCCT 


1440 


AGAATTGGCC 


1500 


GTATTTGTCT 


1560 


CTGGAAAGAT 


1620 


GGTTACCTTC 


1680 


CTTTAACCGA 


1740 


ACACCCAGAC 


1800 


CTGGGTCAAG 


1860 


TCTCCCCCTT 


1920 


TCCTTCTCTA 


1980 


CGACCTACTT 


2040 


GGACGGAAAT 


2100 


ATCTCGCCTA 


2160 


CCCCCTCCGC 


2220 


TTACAACTGG 


2280 


GATCGAGTCT 


2340 


GACTCTGCTG 


2400 


GGGCGATGAT 


2460 


GCGCCCAGAC 


2520 


GTTGCTCCTA 


2580 


AGGAATAACA 


2640 


CTATCGCAGG 


2700 


GTCTTTCATT 


2760 


AAAAAACAAG 


2820 


AACCCCGGAA 


2880 


TAGGACAGAG 


2940 


CAAGCTATTG 


3000 


GAGGTCCCAA 


3060 


AGATTGTCCC 


3120 


CCTAGATGAC 


3180 


AGTCGGGGGG 


3240 


CCAAAATCCT 


3300 


GCGGTATCGC 


3360 


TTTCCTCCAT 


3420 


AAAAGCCCAA 


3480 


CCTGCAAGTG 


3540 


GCCAGATGTT 


3600 


CGGGGGCATG 


3660 


TACCCCCGTG 


3720 


CCACATACAG 


3780 


GCCCCTGCTA 


3840 


AGAAGTCAAC 


3900 


GAGCGGGCTC 


3960 


CTGCCTGAGA 


4020 


GATGGGAATC 


4080 
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TCAGGACAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC CACCCTGTTT ' 4140 
GATGAGGCA.C TGCACAGAGA CCTAGCAGAC TTCCGGATCC AGCACCCAGA CTTGATCCTG * 4200 

CTACAGTACG TGGATGACTT ACTGCTGGCC GCCACTTCTG AGCTAGACTG CCAACAAGGT 4260 

ACTCGGGCCC TGTTACAAAC CCTAGGGAAC CTCGGGTATC GGGCCTCGGC CAAGAAAGCC 4320 

CAAATTTGCC AGAAACAGGT CAAGTATCTG GGGTATCTTC TAAAAGAGGC- TCAGAGATGG 4380 

CTGACTGAGG CCAGAAAAGA GACTGTGATG GGGCAGCCTA CTCCGAAGAC CCCTCGACAA 4440 

CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTTTGCAGAA 4500 

ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGGACTC TGTTTAATTG GGGCCCAGAC 4560 

CAACAAAAGG CCTATCAAGA AATCAAGCAA GCTCTTCTAA CTGCCCCAGC CCTGGGGTTG 4620 

CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCTA CGCCAAAGGT 4680 

GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG CCTACCTGTC CAAAAAGCTA 4740 

GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCAT TGCCGTACTG 4800 

ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATGCA 4860 

GTAGAGGCAC TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATGACTCAC 4920 

TATCAGGCCT TGCTTTTGGA CACGGACCGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAC 4980 

CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 5040 

GCCGAAGCCC ACGGAACCCG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCGACCAC 5100 

ACCTGGTACA CGGATGGAAG CAGTCTCTTA CAAGAGGGAC AGCGTAAGGC GGGAGCTGCG 5160 

GTGACCACCG AGACCGAGGT AATCTGGGCT AAAGCCCTGC CAGCCGGGAC ATCCGCTCAG 5220 

CGGGCTGAAC TGATAGCACT CACCCAGGCC CTAAAGATGG CAGAAGGTAA GAAGCTAAAT 5280 

GTTTATACTG ATAGCCGTTA TGCTTTTGCT ACTGCCCATA TCCATGGAGA AATATACAGA 53 40 

AGGCGTGGGT TGCTCACATC AGAAGGCAAA GAGATCAAAA ATAAAGACGA GATCTTGGCC 5400 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 5460 

AAGGGACACA GCGCCGAGGC TAGAGGCAAC CGGATGGCTG ACCAAGCGGC CCGAAAGGCA 5520 

GC CATC AC AG AGACTCCAGA CACCTCTACC CTCCTCATAG AAAATTCATC ACCCTACACC 5580 

TCAGAACATT TTCATTACAC AGTGACTGAT ATAAAGGACC TAACCAAGTT GGGGGCCATT 5640 

TATGATAAAA CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTGTGATGCC TGACCAGTTT 5700 

ACTTTTGAAT TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 5760 

GCTCTCCTAG AGAGAAGCCA CAGTCCCTAC TACATGCTGA ACCGGGATCG AACACTCAAA 5820 

AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGCCGTTAAA 5880 

CAGGGAACTA GGGTCCGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 5940 

ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TTATAGATAC CTTTTCTGGC 6000 

TGGATAGAAG CCTTCCCAAC CAAGAAAGAA ACCGCCAAGG TCGTAACCAA GAAGCTACTA 6060 

GAGGAGATCT TCCCCAGGTT CGGCATGCCT CAGGTATTGG GAACTGACAA TGGGCCTGCC 6120 

TTCGTCTCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG GAAATTACAT 6180 

TGTGCATACA GACCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG 6240 

ACTTTAACTA AATTAACGCT TGCAACTGGC TCTAGAGACT GGGTGCTCCT ACTCCCCTTA 6300 

GCCCTGTACC GAGCCCGCAA CACGCCGGGC CCCCATGGCC TCACCCCATA TGAGATCTTA 6360 

TATGGGGCAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTACTAAC 6420 

AGCCCCTCTC TCCAAGCTCA STTACAGGCT CTCTACTTAG TCCAGCACGA AGTCTGGAGA 6480 

CCTCTGGCGG CAGCCTACCA AGAACAACTG GACCGACCGG TGGTACCTCA CCCTTACCGA 6540 

GTCGGCGACA CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCGCTGGAAA 6600 

GGACCTTACA CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CATCGCAGCT 6650 

TGGATACACG CCGCCCACGT GAAGGCTGCC GACCCCGGGG GTGGACCATC CTCTAGACTG 6720 

ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 6780 

TAATCCCCTT AATTCTTCTG ATGCTCAGAG ' GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 6840 

CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 6900 

AAGTAGCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA TAAACATCAT GTGGGAGCGG 69 60 

CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG . TATATAGGAC 7020 

GAGTAACTGT TTGTGCAGAA GCCATTGCGA TTGGTAGTGC AGTTTCGAAT GGACAAAAGG 7080 

ATTTTGACAC GATTGTAGCT GTTAGACACC CTTATTCTGA CGAAGTAGAT AGAAGTATTC 7140 

GAGTGGTAAG TCCTTGTGGT ATGTGTAGGG AGTTGATTTC AGACTATGCA CCAGATTGTT 7200 

TTGTGTTAAT AGAAATGAAT GGCAAGTTAG TCAAAACTAC GATTGAAGAA CTCATTCCAC 7260 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 7308 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 60 

TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA * 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 180 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 240 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 360 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 420 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 540 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 720 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 960 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATGG GGCACCCCCG CCCCTTGTAA 1020 

ACTTCCCTGA CCCTGACATG ACCAGAGTTA CTAACAGCCC CTCTCTCCAA GCTCACTTAC 1080 

AGGCTCTCTA CTTAGTCCAG CACGAAGTTT GGAGACCACT GGCGGCAGCT TACCAAGAAC 1140 

AACTGGACCG GCCGGTGGTG CCTCACCCTT ACCGGGTCGG CGACACAGTG TGGGTCCGCC 1200 

GACATCAAAC CAAGAACCTA GAACCTCGCT GGAAAGGACC TTACACAGTC CTGCTGACCA 1260 

CCCCCACCGC CCTCAAAGTA GACGGTATCG CAGCTTGGAT ACACGCAGCC CACGTAAAGG 1320 

CGGCCGACAC CGAGAGTGGA CCATCCTCTG GACGGACATG GCGCGTTCAA CGCTCTCAAA 1380 

ACCCCCTCAA GATAAGATTA ACCCGTGGAA GCCCTTAATA GTCATGGGAG TCCTGTTAGG 1440 

AGTAGGGATG GCAGAGAGCC CCCATCAGGT CTTTAATGTA ACCTGGAGAG TCACCAACCT 1500 

GATGACTGGG CGTACCGCCA ATGCGACCTC CCTCCTGGGA ACTGTACAAG ATGCCTTCCC 1560 

AAAATTATAT TTTGATCTAT GTGATCTGGT CGGAGAGGAG TGGGACCCTT CAGACCAGGA 1620 

ACCGTATGTC GGGTATGGCT GCAAGTACCC CGCAGGGAGA CAGCGGACCC GGACTTTTGA ■ 1680 

CTTTTACGTG TGCCCTGGGC ATACCGTAAA GTCGGGGTGT GGGGGACCAG GAGAGGGCTA 1740 

CTGTGGTAAA TGGGGGTGTG AAACCACCGG ACAGGCTTAC TGGAAGCCCA CATCATCGTG 1800 

GGACCTAATC TCCCTTAAGC GCGGTAACAC CCCCTGGGAC ACGGGATGCT CTAAAGTTGC 1860 

CTGTGGCCCC TGCTACGACC TCTCCAAAGT ATCCAATTCC TTCCAAGGGG CTACTCGAGG 1920 

GGGCAGATGC AACCCTCTAG TCCTAGAATT CACTGATGCA GGAAAAAAGG CTAACTGGGA 1980 

CGGGCCCAAA TCGTGGGGAC TGAGACTGTA CCGGACAGGA ACAGATCCTA TTACCATGTT 2040 

CTCCCTGACC CGGCAGGTCC TTAATGTGGG ACCCCGAGTC CCCATAGGGC CCAACCCAGT 2100 

ATTACCCGAC CAAAGACTCC CTTCCTCACC AATAGAGATT GTACCGGCTC CACAGCCACC 2160 

TAGCCCCCTC AATACCAGTT ACCCCCCTTC CACTACCAGT ACACCCTCAA CCTCCCCTAC 2220 

AAGTCCAAGT GTCCCACAGC CACCCCCAGG AACTGGAGAT AGACTACTAG CTCTAGTCAA 2280 

AGGAGCCTAT CAGGCGCTTA ACCTCACCAA TCCCGACAAG ACCCAAGAAT GTTGGCTGTG 2340 

CTTAGTGTCG GGACCTCCTT ATTACGAAGG AGTAGCGGTC GTGGGCACTT ATACCAATCA 2400 

TTCCACCGCT CCGGCCAACT GTACGGCCAC TTCCCAACAT AAGCTTACCC TATCTGAAGT 2460 

GACAGGACAG GGCCTATGCA TGGGGGCAGT ACCTAAAACT CACCAGGCCT TATGTAACAC 2520 

CACCCAAAGC GCCGGCTCAG GATCCTACTA CCTTGCAGCA CCCGCCGGAA CAATGTGGGC 2580 

TTGCAGCACT GGATTGACTC CCTGCTTGTC CACCACGGTG CTCAATCTAA CCACAGATTA 2640 

TTGTGTATTA GTTGAACTCT GGCCCAGAGT AATTTACCAC TCCCCCGATT ATATGTATGG 2700 

TCAGCTTGAA CAGCGTACCA AATATAAAAG AGAGCCAGTA TCATTGACCC TGGCCCTTCT 2760 

ACTAGGAGGA TTAACCATGG GAGGGATTGC AGCTGGAATA GGGACGGGGA CCACTGCCTT 2820 

AATTAAAACC CAGCAGTTTG AGCAGCTTCA TGCCGCTATC CAGACAGACC TCAACGAAGT 2880 

CGAAAAGTCA ATTACCAACC TAGAAAAGTC ACTGACCTCG TTGTCTGAAG • TAGTCCTACA 2940 

GAACCGCAGA GGCCTAGATT TGCTATTCCT AAAGGAGGGA GGTCTCTGCG CAGCCCTAAA 3000 

AGAAGAATGT TGTTTTTATG CAGACCACAC GGGGCTAGTG AGAGACAGCA TGGCCAAATT 3060 

AAGAGAAAGG CTTAATCAGA GACAAAAACT ATTTGAGACA GGCCAAGGAT GGTTCGAAGG 3120 

GCTGTTTAAT AGATCCCCCT GGTTTACCAC CTTAATCTCC ACCATCATGG GACCTCTAAT 3180 

AGTACTCTTA CTGATCTTAC TCTTTGGACC TTGCATTCTC AATCGATTAG TTCAATTTGT 3240 

TAAAGACAGG ATCTCAGTAG TCCAGGCTTT AGTCCTGACT CAACAATACC ACCAGCTAAA 3300 

GCCTATAGAG TACGAGCCAT AGGGCGCCTA GTGTTGACAA TTAATCATCG GCATAGTATA 3360 

CGGCATAGTA TAATACGACT CACTATAGGA GGGCCACCAT GGCCAAGTTG ACCAGTGCCG 3420 

TTCCGGTGCT CACCGCGCGC GACGTCGCCG GAGCGGTCGA GTTCTGGACC GACCGGCTCG 3 480 

GGTTCTCCCG GGACTTCGTG GAGGACGACT TCGCCGGTGT GGTCCGGGAC GACGTGACCC 3540 

TGTTCATCAG CGCGGTCCAG GACCAGGTGG TGCCGGACAA CACCCTGGCC TGGGTGTGGG 3 600 

TGCGCGGCCT GGACGAGCTG TACGCCGAGT GGTCGGAGGT CGTGTCCACG AACTTCCGGG 3660 

ACGCCTCCGG GCCGGCCATG ACCGAGATCG GCGAGCAGCC GTGGGGGCGG GAGTTCGCCC 3720 

TGCGCGACCC GGCCGGCAAC TGCGTGCACT TCGTGGCCGA GGAGCAGGAC TGANNNNCGG 3780 

ACCGGTCGAC TTGTTAACTT GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC 3840 

ATCACAAATT TCACAAATAA AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA 3900 

CTCATCAATG TATCTTATCA TGTCTGGATC CAGATCTGGG CCCATGCGGC CGCGGATCGA 3960 

TNNNNACATG TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT 4020 

GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 4080 
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Figure 9. FBdelPASAF Sequence 2 

GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT 4140 
CGTGCGCTCT CCTGTTCCGA CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC * 4200 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT 4260 

TCGCTCCAAG CTGGGCTGTG TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 4320 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT AAGACACGAC TTATCGCCAC TGGCAGCAGC 4380 

CACTGGTAAC AGGATTAGCA GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG 4440 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC AGTATTTGGT ATCTGCGCTC TGCTGAAGCC 4500 

AGTTACCTTC GGAAAAAGAG TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG 4560 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT TACGCGCAGA AAAAAAGGAT CTCAAGAAGA 4620 

TCCTTTGATC TTTTCTACGG GGTCTGACGC TCAGTGGAAC GAAAACTCAC GTTAAGGGAT 4680. 

TTTGGTCATG AGATTATCAA AAAGGATCTT CACCTAGATC CTTTTAAATT AAAAATGAAG 4740 

TTTTAAATCA ATCTAAAGTA TATATGAGTA AACTTGGTCT GACAGTTACC AATGCTTAAT 4800 

CAGTGAGGCA CCTATCTCAG CGATCTGTCT ATTTCGTTCA TCCATAGTTG CCTGACTCCC 4860 

CGTCGTGTAG ATAACTACGA TACGGGAGGG CTTACCATCT GGCCCCAGTG CTGCAATGAT 4920 

ACCGCGAGAC CCACGCTCAC CGGCTCCAGA TTTATCAGCA ATAAACCAGC CAGCCGGAAG 4980 

GGCCGAGCGC AGAAGTGGTC CTGCAACTTT ATCCGCCTCC ATCCAGTCTA TTAATTGTTG 5040 

CCGGGAAGCT AGAGTAAGTA GTTCGCCAGT TAATAGTTTG CGCAACGTTG TTGCCATTGC 5100 

TACAGGCATC GTGGTGTCAC GCTCGTCGTT TGGTATGGCT TCATTCAGCT CCGGTTCCCA 5160 

ACGATCAAGG CGAGTTACAT GATCCCCCAT GTTGTGCAAA AAAGCGGTTA GCTCCTTCGG 5220 

TCCTCCGATC GTTGTCAGAA GTAAGTTGGC CGCAGTGTTA TCACTCATGG TTATGGCAGC 5280 

ACTGCATAAT TCTCTTACTG TCATGCCATC CGTAAGATGC TTTTCTGTGA CTGGTGAGTA 5340 

CTCAACCAAG TCATTCTGAG AATAGTGTAT GCGGCGACCG AGTTGCTCTT GCCCGGCGTC 5400 

AATACGGGAT AATACCGCGC CACATAGCAG AACTTTAAAA GTGCTCATCA TTGGAAAACG 5460 

TTCTTCGGGG CGAAAACTCT CAAGGATCTT ACCGCTGTTG AGATCCAGTT CGATGTAACC 5520 

CACTCGTGCA CCCAACTGAT CTTCAGCATC TTTTACTTTC ACCAGCGTTT CTGGGTGAGC 5580 

AAAAACAGGA AGGCAAAATG CCGCAAAAAA GGGAATAAGG GCGACACGGA AATGTTGAAT 5640 

ACTCATACTC TTCCTTTTTC AATATTATTG AAGCATTTAT CAGGGTTATT GTCTCATGAG 5700 

CGGATACATA TTTGAATGTA TTTAGAAAAA TAAACAAATA GGGGTTCCGC GCACATTTCC 5760 

CCGAAAAGTG CCACCTGACG TCTAAGAAAC CATTATTATC ATGACATTAA CCTATAAAAA 5 820 

TAGGCGTATC ACGAGGCCCT TTCGTCTCGC GCGTTTCGGT GATGACGGTG AAAACCTCTG 5880 

ACACATGCAG CTCCCGGAGA CGGTCACAGC TTGTCTGTAA GCGGATGCCG GGAGCAGACA 5940 

AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG CGGGTGTCGG GGCTGGCTTA ACTATGCGGC 6000 

ATCAGAGCAG ATTGTACTGA GAGTGCAC 6028 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 60 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA " 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 180 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 240 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 3 SO 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 420 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 540 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAAATGACC CTGTGCCTTA TTTGAATTAA GCAATCAGCC TGCTTCTCGC TTCTGTTCGC 720 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 960 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATGG GGCACCCCCG CCCCTTGTAA 1020 

ACTTCCCTGA CCCTGACATG ACAAGAGTTA CTAACAGCCC CTCTCTCCAA GCTCACTTAC 1080 

AGGCTCTCTA CTTAGTCCAG CACGAAGTCT GGAGACCTCT GGCGGCAGCC TACCAAGAAC 1140 

AACTGGACCG ACCGGTGGTA CCTCACCCTT ACCGAGTCGG CGACACAGTG TGGGTCCGCC 1200 

GACACCAGAC TAAGAACCTA GAACCTCGGT GGAAAGGACC TTACACAGTC CTGCTGACCA 1260 

CCCCCACCGC CCTCAAAGTA GACGGCATCG CAGCTTGGAT ACACGCCGCC CACGTGAAGG 1320 

CTGCCGACCC CGGGGGTGGA CCATCCTCTA GACTGACATG GCGCGTTCAA CGCTCTCAAA 13 80 

ACCCCTTAAA AATAAGGTTA ACCCGCGAGG CCCCCTAATC CCCTTAATTC TTCTGATGCT 1440 
CAGAGGGGTC AGTACTGCTT CGCCCGGCTC CAGTCCTCAT CAAGTCTATA ATATCACCTG 1 1500 

GGAGGTAACC AATGGAGATC GGGAGACGGT ATGGGCAACT " TCTGGCAACC ACCCTCTGTG 1560 

GACCTGGTGG CCTGACCTTA CCCCAGATTT ATGTATGTTA GCCCACCATG GACCATCTTA 1620 

TTGGGGGCTA GAATATCAAT CCCCTTTTTC TTCTCCCCCG GGGCCCCCTT GTTGCTCAGG 1680 

GGGCAGCAGC CCAGGCTGTT CCAGAGACTG CGAAGAACCT TTAACCTCCC TCACCCCTCG 1740 

GTGCAACACT GCCTGGAACA GACTCAAGCT AGACCAGACA ACTCATAAAT CAAATGAGGG 1800 

ATTTTATGTT TGCCCCGGGC CCCACCGCCC CCGAGAATCC AAGTCATGTG GGGGTCCAGA 1860 

CTCCTTCTAC TGTGCCTATT GGGGCTGTGA GACAACCGGT AGAGCTTACT GGAAGCCCTC 1920 

CTCATCATGG GATTTCATCA CAGTAAACAA CAATCTCACC TCTGACCAGG CTGTCCAGGT 1980 

ATGCAAAGAT AATAAGTGGT GCAACCCCTT AGTTATTCGG TTTACAGACG CCGGGAGACG 2040 

GGTTACTTCC TGGACCACAG GACATTACTG GGGCTTACGT TTGTATGTCT CCGGACAAGA 2100 

TCCAGGGCTT ACATTTGGGA TCCGACTCAG ATACCAAAAT CTAGGACCCC GCGTCCCAAT 2160' 

AGGGCCAAAC CCCGTTCTGG CAGACCAACA GCCACTCTCC AAGCCCAAAC CTGTTAAGTC .2220 

GCCTTCAGTC ACCAAACCAC CCAGTGGGAC TCCTCTCTCC CCTACCCAAC TTCCACCGGC 2280 

GGGAACGGAA AATAGGCTGC TAAACTTAGT AGACGGAGCC TACCAAGCCC TCAACCTCAC 2340 

CAGTCCTGAC AAAACCCAAG AGTGCTGGTT GTGTCTAGTA GCGGGACCCC CCTACTACGA 2400 

AGGGGTTGCC GTCCTGGGTA CCTACTCCAA CCATACCTCT GCTCCAGCCA ACTGCTCCGT 2460 

GGCCTCCCAA CACAAGTTGA CCCTGTCCGA AGTGACCGGA CAGGGACTCT GCATAGGAGC 2520 

AGTTCCCAAA ACACATCAGG CCCTATGTAA TACCACCCAG ACAAGCAGTC GAGGGTCCTA 2580 

TTATCTAGTT GCCCCTACAG GTACCATGTG GGCTTGTAGT ACCGGGCTTA CTCCATGCAT 2 540 

CTCCACCACC ATACTGAACC TTACCACTGA TTATTGTGTT CTTGTCGAAC TCTGGCCAAG 2700 

AGTCACCTAT CATTCCCCCA GCTATGTTTA CGGCCTGTTT GAGAGATCCA ACCGACACAA 2760 

AAGAGAACCG GTGTCGTTAA CCCTGGCCCT ATTATTGGGT GGACTAACCA TGGGGGGAAT 2820 

TGCCGCTGGA ATAGGAACAG GGACTACTGC TCTAATGGCC ACTCAGCAAT TCCAGCAGCT 2880 

CCAAGCCGCA GTACAGGATG ATCTCAGGGA GGTTGAAAAA TCAATCTCTA ACCTAGAAAA 2940 

GTCTCTCACT TCCCTGTCTG AAGTTGTCCT ACAGAATCGA AGGGGCCTAG ACTTGTTATT 3000 

TCTAAAAGAA GGAGGGCTGT GTGCTGCTCT AAAAGAAGAA TGTTGCTTCT ATGCGGACCA 3060 

CACAGGACTA GTGAGAGACA GCATGGCCAA ATTGAGAGAG AGGCTTAATC AGAGACAGAA 3120 

ACTGTTTGAG TCAACTCAAG GATGGTTTGA GGGACTGTTT AACAGATCCC CTTGGTTTAC 3180 

CACCTTGATA TCTACCATTA TGGGACCCCT CATTGTACTC CTAATGATTT TGCTCTTCGG 3240 

ACCCTGCATT CTTAATCGAT TAGTTCAATT TGTTAAAGAC AGGATCTCAG TAGTCCAGGC 3300 

TTTAGTCCTG ACTCAACAAT ACCACCAGCT AAAGCCTATA GAGTACGAGC CATAGGGCGC 3360 

CTAGTGTTGA CAATTAATCA TCGGCATAGT ATACGGCATA GTATAATACG ACTCACTATA 3420 

GGAGGGCCAC CATGGCCAAG TTGACCAGTG CCGTTCCGGT GCTCACCGCG CGCGACGTCG 3480 

CCGGAGCGGT CGAGTTCTGG ACCGACCGGC TCGGGTTCTC CCGGGACTTC GTGGAGGACG 3540 

ACTTCGCCGG TGTGGTCCGG GACGACGTGA CCCTGTTCAT CAGCGCGGTC CAGGACCAGG 3600 

TGGTGCCGGA CAACACCCTG GCCTGGGTGT GGGTGCGCGG CCTGGACGAG CTGTACGCCG 3660 

AGTGGTCGGA GGTCGTGTCC ACGAACTTCC GGGACGCCTC CGGGCCGGCC ATGACCGAGA 3720 

TCGGCGAGCA GCCGTGGGGG CGGGAGTTCG CCCTGCGCGA CCCGGCCGGC AACTGCGTGC 3780 

ACTTCGTGGC CGAGGAGCAG GACTGANNNN CGGACCGGTC GACTTGTTAA CTTGTTTATT 3840 

GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA ATTTCACAAA TAAAGCATTT 3900 

TTTTCACTGC ATTCTAGTTG TGGTTTGTCC AAACTCATCA ATGTATCTTA TCATGTCTGG 3960 

ATCCAGATCT GGGCCCATGC GGCCGCGGAT CGATNNNMAC ATGTGAGCAA AAGGCCAGCA 4020 

AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC 4080 
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TGACC-AGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA • 4140 

AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 4200 

GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 42 60 

ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 4320 

ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC 4380 

GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG 4440 

GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG 4500 

GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 45 60 

CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 4620 

GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 4680 

CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT 4740 

CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA. 4800 

GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG 4860 

TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA 4920 

GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC 4980 

AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 5040 

TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 5100 

AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC 5160 

GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC 5220 

CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 5280 

GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC 5340 

ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG 5400 

TATGCGGCGA CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG 5460 

CAGAACTTTA AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 5520 

CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC 5580 

ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA 5640 

AAAGGGAATA AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA 5700 

TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA 5760 

AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG ACGTCTAAGA 5820 

AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTCT 5880 

CGCGCGTTTC GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC 5940 

AGCTTGTCTG TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCGCGT CAGCGGGTGT 6000 

TGGCGGGTGT CGGGGCTGGC TTAACTATGC GGCATCAGAG CAGATTGTAC TGAGAGTGCA 6060 

C 6061 
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CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 60 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA " 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAGGGTTT 180 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 240 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACCA 300 

CTAGAATACG AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 360 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 420 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 480 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 540 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 600 

GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACC 660 

TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCTCGC TTCTGTTCGC 720 

GCGCTTCTGC TTCCCGAGCT CTATAAAAGA GCTCACAACC CCTCACTCGG CGCGCCAGTC 780 

CTCCGATAGA CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT GCTGTTGCAT 840 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 900 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 960 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCCCTAAGGT ACTCGGGTCA GACAATGGCC 1020 

CGGCCTTTGT TGCTCAGGTA AGTCAGGGAC TGGCCACTCA ACTGGGGATA AATTGGAAGT 1080 

TACATTGTGC GTATAGACCC CAGAGCTCAG GTCAGGTAGA AAGAATGAAC AGAACAATTA 1140 

AAGAGACCTT GACCAAATTA GCCTTAGAGA CCGGTGGAAA AGACTGGGTG ACCCTCCTTC 1200 

CCTTAGCGCT GCTTAGGGCC AGGAATACGC CTGGCCGGTT TGGTTTAACT CCTTATGAAA 1260 

TTCTCTATGG AGGACCACCC CCCATACTTG AGTCTGGAGA AACTTTGGGT CCCGATGATA 1320 

GATTTCTCCC TGTCTTATTT ACTCACTTAA AGGCTTTAGA AATTGTAAGG ACCCAAATCT 1380 

GGGACCAGAT CAAAGAGGTG TATAAGCCTG GTACCGTAAC AATCCCTCAC CCGTTCCAGG 1440 

TCGGGGATCA AGTGCTTGTC AGACGCCATC GACCCAGCAG CCTTGAGCCT CGGTGGAAAG 1500 

GCCCATACCT GGTGTTGCTG ACTACCCCGA CCGCGGTAAA AGTCGATGGT ATTGCTGCCT 1560 

GGGTCCATGC TTCTCACCTC AAACCTGCAC CACCTTCGGC ACCAGATGAG TCCTGGGAGC 1620 

TGGAAAAGAC TGATCATCCT CTTAAGCTGC GTATTCGGCG GCGGCGGGAC GAGTCTGCAA 1680 

AATAAGAACC CCCACCAGCC CATGACCCTC ACTTGGCAGG TACTGTCCCA AACTGGAGAC 1740 

GTTGTCTGGG ATACAAAGGC AGTCCAGCCC CCTTGGACTT GGTGGCCCAC ACTTAAACCT 1800 

GATGTATGTG CCTTGGCGGC TAGTCTTGAG TCCTGGGATA TCCCGGGAAC CGATGTCTCG 1860 

TCCTCTAAAC GAGTCAGACC TCCGGACTCA GACTATACTG CCGCTTATAA GCAAATCACC 1920 

TGGGGAGCCA TAGGGTGCAG CTACCCTCGG GCTAGGACTA GAATGGCAAG CTCTACCTTC 1980 

TACGTATGTC CCCGGGATGG CCGGACCCTT TCAGAAGCTA GAAGGTGCGG GGGGCTAGAA 2040 

TCCCTATACT GTAAAGAATG GGATTGTGAG ACCACGGGGA CCGGTTATTG GCTATCTAAA 2100 

TCCTCAAAAG ACCTCATAAC TGTAAAATGG GACCAAAATA GCGAATGGAC TCAAAAATTT 2160 

CAACAGTGTC ACCAGACCGG CTGGTGTAAC CCCCTTAAAA TAGATTTCAC AGACAAAGGA 2220 

AAATTATCCA AGGACTGGAT AACGGGAAAA ACCTGGGGAT TAAGATTCTA TGTGTCTGGA 2280 

CATCCAGGCG TACAGTTCAC CATTCGCTTA AAAATCACCA ACATGCCAGC TGTGGCAGTA 2340 

GGTCCTGACC TCGTCCTTGT GGAACAAGGA CCTCCTAGAA CGTCCCTCGC TCTCCCACCT 2400 

CCTCTTCCCC CAAGGGAAGC GCCACCGCCA TCTCTCCCCG ACTCTAACTC CACAGCCCTG 2460 

GCGACTAGTG CACAAACTCC CACGGTGAGA AAAACAATTG TTACCCTAAA CACTCCGCCT 2520 

CCCACCACAG GCGACAGACT TTTTGATCTT GTGCAGGGGG CCTTCCTAAC CTTAAATGCT 2580 

ACCAACCCAG GGGCCACTGA GTCTTGCTGG CTTTGTTTGG CCATGGGCCC CCCTTATTAT 2640 

GAAGCAATAG CCTCATCAGG AGAGGTCGCC TACTCCACCG ACCTTGACCG GTGCCGCTGG 2700 

GGGACCCAAG GAAAGCTCAC CCTCACTGAG GTCTCAGGAC ACGGGTTGTG CATAGGAAAG 2760 

GTGCCCTTTA CCCATCAGCA TCTCTGCAAT CAGACCCTAT CCATCAATTC CTCCGGAGAC 2820 

CATCAGTATC TGCTCCCCTC CAACCATAGC TGGTGGGCTT GCAGCACTGG CCTCACCCCT 2880 

TGCCTCTCCA CCTCAGTTTT TAATCAGACT AGAGATTTCT GTATCCAGGT CCAGCTGATT 2940 

CCTCGCATCT ATTACTATCC TGAAGAAGTT TTGTTACAGG CCTATGACAA TTCTCACCCC 3000 

AGGACTAAAA GAGAGGCTGT CTCACTTACC CTAGCTGTTT TACTGGGGTT GGGAATCACG 3060 

GCGGGAATAG GTACTGGTTC AACTGCCTTA ATTAAAGGAC CTATAGACCT CCAGCAAGGC 3120 

CTGACAAGCC TCCAGATCGC CATAGATGCT GACCTCCGGG CCCTCCAAGA CTCAGTCAGC 3180 

AAGTTAGAGG ACTCACTGAC TTCCCTGTCC GAGGTAGTGC TCCAAAATAG GAGAGGCCTT 3240 
GACTTGCTGT TTCTAAAAGA AGGTGGCCTC TGTGCGGCCC TAAAGGAAGA GTGCTGTTTT • 3300 

TACATAGACC ACTCAGGTGC AGTACGGGAC TCCATGAAAA AACTCAAAGA AAAACTGGAT 33 60 

AAAAGACAGT TAGAGCGCCA GAAAAGCCAA AACTGGTATG AAGGATGGTT CAATAACTCC 3420 

CCTTGGTTCA CTACCCTGCT ATCAACCATC GCTGGGCCCC TATTACTCCT CCTTCTGTTG 3480 

CTCATCCTCG GGCCATGCAT CATCAATCGA TTAGTTCAAT TTGTTAAAGA CAGGATCTCA 3540 

GTAGTCCAGG CTTTAGTCCT GACTCAACAA TACCACCAGC TAAAGCCTAT AGAGTACGAG 3600 

CCATAGGGCG CCTAGTGTTG ACAATTAATC ATCGGCATAG TATACGGCAT AGTATAATAC 3660 

GACTCACTAT AGGAGGGCCA CCATGGCCAA GTTGACCAGT GCCGTTCCGG TGCTCACCGC 3720 

GCGCGACGTC GCCGGAGCGG TCGAGTTCTG GACCGACCGG CTCGGGTTCT CCCGGGACTT 3780 

CGTGGAGGAC GACTTCGCCG GTGTGGTCCG GGACGACGTG ACCCTGTTCA TCAGCGCGGT 3840 

CCAGGACCAG GTGGTGCCGG ACAACACCCT GGCCrGGGHG TGGGTGCGCG GCCTGGACGA 3900 

GCTGTACGCC GAGTGGTCGG AGGTCGTGTC CACGAACTTC CGGGACGCCT CCGGGCCGGC 3960 

CATGACCGAG ATCGGCGAGC AGCCGTGGGO GCGGGAGTTC GCCCTGCGCG ACCCGGCCGG 4020 

CAACTGCGTG CACTTCGTGG CCGAGGAGCA GGACTGANNN NCGGACCGGT CGACTTGT^A 4080 
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ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA . 4140 

ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT 4200 

ATCATGTCTG GATCCAGATC TGGGCCCATG CGGCCGCGGA TCGATNNNNA CATGTGAGCA 4260 

AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG 4320 

CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 4380 

ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 4440 

CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 4500 

TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC 4560 

TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 4620 

GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 4680 

AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC 4740 

TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 4800 

AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT 4860 

TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT 4920 

ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 4980 

TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 5040 

AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 5100 

TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 5160 

ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC 5220 

TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 5280 

GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 5340 

AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 5400 

TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT 5460 

ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTG7C 5520 

AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 5580 

ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 5640 

TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 5700 

GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 5760 

CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 5820 

TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 5880 

AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT .5940 

TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 6000 

TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 6060 

GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG 6120 

CCCTTTCGTC TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 6180 

GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG 6240 

TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA 6300 

CTGAGAGTGC AC 6312 
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CATATGCGGT GTGAAATACC GCACAGATGC 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA 
CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA 
TCCCAGTCAC GACGTTGTAA AACGACGGCC 
GACAGGATCT CAGTAGTCCA GGCTTTAGTC 
CTAGAATACG AGCCACAATA AATAAAAGAT 
GAAAGACCCC ACCAAATTGC TTAGCCTGAT 
GAAAAATACC AAACCAAGAA TAGAGAAGTT 
AACGTTGGGC CAAACAGGAT ATCTGCGGTG 
ACAGATGGTC ACCGCGGTTC GGCCCCGGCC 
GGCCCAACCC TCAGCAGTTT CTTAAGACCC 
TGAAATGACC CTGTGCCTTA TTTGAATTAA 
GCGCTTCTGC TTCCCGAGCT CTATAAAAGA 
CTCCGATAGA CTGAGTCGCC CGGGTACCCG 
CCGACTCGTG GTCTCGCTGT TCCTTGGGAG 
CGGGGGTCTT TCATTTGGGG GCTCGTCCGG 
ACCCACCACC GGGAGGTAAG CTGGCCAAGA 
TTTATGC-GGG ACCCCCCCCT TTGTCAACCT 
AGACTGATTT ACAAGCCCGA CTAAAAGGGC 
CCCTGGCCGA ATTGTACCGG CCAGGACATC 
ACTCCGTGTA CGTCCGGCGG CACCGCTCTC 
ACATCGTCCT GCTGACCACG CCCACCGCCA 
ACGCATCGCA CGCCAAGGCA GCCCCAAAAA 
AGCTCCGCCG TTCGGAGAAC CCTCTTAAGA 
ACCTTGTCCC TGTACTAACC CAAAATGAAA 
CTAATAATAG TTCGGGCAGG GTTTGACGAC 
CAACATGGTA AACCATGCGA ATGCAGCGGA 
ATCCAACAGG TAACTTGCCC AGGCAAGACG 
TGCAGAGTCA CTCCAAAAAT CTCACCTAGC 
ACTTTCCAGG ACTCGATGCA CAGTTCTTGT 
AATAAGACAT ACTACACGGC CACCTTGCTT 
CAGATATTAC AAAACCCCAA TCAGCTCCTA 
CCCGTTTGCT GGAGTGCCAC AGCCCCCATC 
ACTAAGAGAG TGTGGACAGT CCAAAAAAGG 
GAACTTCAAT ACCACCCCTT AGCCCTGCCC 
CGGACTTTTG ATATCCTGAA TACCACTTTT 
GCCCAAGATT GTTGGCTCTG TTTAAAACTA 
CCCTCTTTAA CCTACTCCCT AGCAGACTCC 
CCCCTCTTGG TTCAACCGAT GCAGTTCTCC 
AACGATACGG AACAAATAGA CTTAGGTGCA 
AATGTCAGTA GTCCTTTATG TGCCCTAAAC 
GCATACACCT ATTTACCCCA AAACTGGACC 
GACATTGACA TCAACCCGGG GGATGAGCCA 
CATAGACCTA AACGAGCTGT ACAGTTCATC 
GCATTCACCA CCGGAGCTAC AGGCCTAGGT 
CATCAGTTAA TATCTGAXGT CCAAGTCTTA 
GTAGACTCGT TAGCTGAAGT AGTTCTCCAA 
GAACAAGGAG GAATTTGTTT AGCCTTACAA 
GGAATTGTGA GAAACAAAAT AAGAACCCTA 
CTGGCAACCA ACCCTCTCTG GACCGGGCTG 
CTGGGACCCC TACTCACCCT CCTACTCATA 
CTCATGGCCT TCATTAATGA TAGACTTAAT 
TACCAAGCAC TCAAAGCTGA GGAAGAAGCT 
ATCATCGGCA TAGTATACGG CATAGTATAA 
CAAGTTGACC AGTGCCGTTC CGGTGCTCAC 
CTGGACCGAC CGGCTCGGGT TCTCCCGGGA 
CCGGGACGAC GTGACCCTGT TCATCAGCGC 
CCTGGCCTGG GTGTGGGTGC GCGGCCTGGA 
GTCCACGAAC TTCCGGGACG CCTCCGGGCC 
GGGGCGGGAG TTCGCCCTGC GCGACCCGGC 
GCAGGACTGA NNNNCGGACC GGTCGACTTG 
ACAAATAAAG CAATAGCATC ACAAATTTCA 
GTTGTGGTTT GTCCAAACTC ATCAATGTAT 
ATGCGGCCGC GGATCGATNN NHACATGTGA 
AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT 
AATCGACGCT CAAGTCAGAG GTGGCGAAAC 
CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT 
TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 



GTAAGGAGAA AATACCGCAT 
GGGCGATCGG TGCGGGCCTC 
AGGCGATTAA GTTGGGTAAC 
AGTGAATTCC GATTAGTTCA 
CTGACTCAAC AATACCACCA 
TTTATTTAGT TTCCAGAAAA 
AGCCGCAGTA ACGCCATTTT 
CAGATCAAGG GCGGGTACAC 
AGCAGTTTCG GCCCCGGCCC 
CGGGGCCAAG AACAGATGGT 
ATCAGATGTT TCCAGGCTCC 
CCAATCAGCC TGCTTCTCGC 
GCTCACAACC CCTCACTCGG 
TGTATCCAAT AAATCCTCTT 
GGTCTCCTCA GAGTGATTGA 
GATCTGGAGA CCCCTGCCCA 
TCCCCCGGGC TGCAGGAATT 
TGCTCAATTC CTTCTCCCCC 
TGCAAGGCGT GCAGGCCCAA 
CACAAACTAG CCACCCATTT 
AAGGATTGGA GCCTCGTTGG 
TAAAGGTTGA CGGGATCGCC 
CCCCTGGACC AGAAACTCCC 
TAAGACTCTC CCGTGTCTGA 
CTCCCAACAG GAATGGTCAT 
CCCCGCAAGG CTATCGCATT 
GGGCAGGTAT CCGAGGCCCC 
GCCTACTTAA TGACCAACCA 
GGGGGAGAAC TCCAGAACTG 
TATACTGAAT ACCGGCAATG 
AAAATACGGT CTGGGAGCCT 
CAGTCCCCTT GTAGGGGCTC 
CATATCTCCG ATGGTGGAGG 
CTAGAACAAA TTCATAAGGC 
AAAGTCAGAG ATGACCTTAG 
AGGTTACTCC AGATGTCCAA 
GGTACCCCTA CCCCTCTTGC 
CTAGCGAATG CCTCCTGTCA 
AACTCGTCCT GTTTATCTTC 
GTCACCTTTA CTAACTGCAC 
GGGTCAGTCT TCCTCTGTGG 
AGACTTTGCG TCCAAGCCTC 
GTCCCCATTC CTGCCATTGA 
CCTTTACTAG CTGGACTGGG 
GTCTCCGTCA CCCAGTATAC 
TCCGGTACCA TACAAGATTT 
AATAGGAGGG GACTGGACCT 
GAAAAATGCT GTTTTTATGC 
CAAGAAGAAT TACAAAAACG 
CAGGGCTTTC TTCCGTACCT 
CTAACCATTG GGCCATGCGT 
GTTGTACATG CCATGGTGCT 
CAGGATTGAG GCGCCTAGTG 
TACGACTCAC TATAGGAGGG 
CGCGCGCGAC GTCGCCGGAG 
CTTCGTGGAG GACGACTTCG 
GGTCCAGGAC CAGGTGGTGC 
CGAGCTGTAC GCCGAGTGGT 
GGCCATGACC GAGATCGGCG 
CGGCAACTGC GTGCACTTCG 
TTAACTTGTT TATTGCAGCT 
CAAATAAAGC ATTTTTTTCA 
CTTATCATGT CTGGATCCAG 
GCAAAAGGCC AGCAAAAGGC 
AGGCTCCGCC CCCCTGACGA 
CCGACAGGAC TATAAAGATA 
GTTCCGACCC TGCCGCTTAC 
CTTTCTCAAT GCTCACGCTG 



CAGGCGCCAT 


60 


TTCGCTATTA " 


120 


GCCAGGGTTT 


180 


ATTTGTTAAA 


240 


GCTAAAACCA 


300 


AGGGGGGAAT 


360 


GCAAGGCATG 


420 


GAAAACAGCT 


480 


GGGGCCAAGA 


540 


CCCCAGATAT 


600 


CCCAAGGACC 


660 


TTCTGTTCGC 


720 
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GCTGTTGCAT 
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CTACCCGTCT 
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TATGAAATCC 
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1260 
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ACCCCTCGAT 
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TATGACTCCT 


2040 


CCTTGATGCA 


2100 


TTTTAGCCTT 


2160 


GATACCCACT 


2220 


GATTATACCT 


2280 


CCCTTTCATT 


2340 


CTCTGTAGCC 


2400 
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2520 


T CATTATAT A 


2580 


AATCACCGCA 
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2700 
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CCGGTGTGGT 


3360 


CGGACAACAC 


3420 


CGGAGGTCGT 


3480 


AGCAGCCGTG 


3540 


TGGCCGAGGA 


3600 


TATAATGGTT 


3660 


CTGCATTCTA 


3720 


ATCTGGGCCC 


3780 


CAGGAACCGT 


3840 


GCATCACAAA 


3900 


CCAGGCGTTT 


3960 


CGGATACCTG 


4020 


TAGGTATCTC 


4080 
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AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC 4140 
GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA " 4200 

TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT 4260 

ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 4320 

TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA 4380 

CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA 4440 

AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA 4500 

AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT 4560 

TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 4620 

AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCGTTCATCC 4680 

ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC 4740 

CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA 4800 

AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC 4860 

CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 4920 

AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA 4980 

TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA 5040 

GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 5100 

CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 5160 

TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT 5220 

TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG 5280 

CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC GCTGTTGAGA 5340 

TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 5400 

AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 5460 

ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG 5520 

GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 5580 

GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAGAAACCAT TATTATCATG 5640 

ACATTAACCT ATAAAAATAG GCGTATCACG AGGCCCTTTC GTCTCGCGCG TTTCGGTGAT 5700 

GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG TCACAGCTTG TCTGTAAGCG 5760 

GATGCCGGGA GCAGACAAGC CCGTCAGGGC GCGTCAGCGG GTGTTGGCGG GTGTCGGGGC 5820 

TGGCTTAACT ATGCGGCATC AGAGCAGATT GTACTGAGAG TGCAC 5865 
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AGATCTCCCG ATCCCCTATG GTCGACTCTC 
AGCCAGTATC TGCTCCCTGC TTGTGTGTTG 
TAAGCTACAA CAAGGCAAGG CTTGACCGAC 
CGTTTTGCGC TGCTTCGCGA TGTACGGGCC 
AGTTATTAAT AGTAATCAAT TACGGGGTCA 
GTTACATAAC TTACGGTAAA TGGCCCGCCT 
ACGTCAATAA TGACGTATGT TCCCATAGTA 
TGGGTGGACT ATTTACGGTA AACTGCCCAC 
AGTACGCCCC CTATTGACGT CAATGACGGT 
ATGACCTTAT GGGACTTTCC TACTTGGCAG 
ATGGTGATGC GGTTTTGGCA GTACATCAAT 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT 
GACTTTCCAA AATGTCGTAA CAACTCCGCC 
CGGTGGGAGG TCTATATAAG CAGAGCTCTC 
GCTTATCGAA ATGTCGACTG AGAACTTCAG 
CTTTTTCGCT ATTGTAAAAT TCATGTTATA 
TAGAATGGGA AGATGTCCCT TGTATCACCA 
CTTTCTACTC TGTTGACAAC CATTGTCTCC 
TTCGTTAAAC TTTAGCTTGC ATTTGTAACG 
AGATTGTAAG TACTTTCTCT AATCACTTTT 
TACTTCAGCA CAGTTTTAGA GAACAATTGT 
GCATATAAAT TCTGGCTGGC GTGGAAATAT 
CATCATCCTG CCTTTCTCTT TATGGTTACA 
ATACTCTGAG TCCAAACCGG GCCCCTCTGC 
ACAGCTCCTG GGCAACGTGC TGGTTGTTGT 
GGAACAGCAT CAGGACCGAC ATGGAAGGTC 
TTAACCCGTG GAAGTCCTTA ATGGTCATGG 
GCCCCCATCA GGTCTTTAAT GTAACCTGGA 
CCAATGCCAC CTCCCTTTTA GGAACTGTAC 
TATGTGATCT GGTCGGAGAA GAGTGGGACC 
GCTGCAAATA CCCCGGAGGG AGAAAGCGGA 
GGCATACCGT AAAATCGGGG TGTGGGGGGC 
GTGAAACCAC CGGACAGGCT TACTGGAAGC 
AGCGCGGTAA CACCCCCTGG GACACGGGAT 
ACCTCTCCAA AGTATCCAAT TCCTTCCAAG 
TAGTCCTAGA ATTCACTGAT GCAGGAAAAA 
GACTGAGACT GTACCGGACA GGAACAGATC 
TCCTCAATAT AGGGCCCCGC ATCCCCATTG 
CCCCCTCCCG ACCCGTGCAG ATCAGGCTCC 
CAGCCTCTAT AGTCCCTGAG ACTGCCCCAC 
TGCTAAACCT GGTAGAAGGA GCCTATCAGG 
AAGAATGTTG GCTGTGCTTA GTGTCGGGAC 
GCACTTATAC CAATCATTCT ACCGCCCCGG 
TTACCCTATC TGAAGTGACA GGACAGGGCC 
AGGCCTTATG TAACACCACC CAAAGTGCCG 
CTGGAACAAT GTGGGCTTGT AGCACTGGAT 
ATCTAACCAC AGACTATTGT GTATTAGTTG 
CCGATTATAT GTATGGTCAG CTTGAACAGC 
TGACCCTGGC CCTTCTGCTA GGAGGATTAA 
CGGGGACCAC TGCCCTAATC AAAACCCAGC 
CAGACCTCAA CGAAGTCGAA AAATCAATTA 
CTGAAGTAGT CCTACAGAAC CGAAGAGGCC 
TCTGCGCAGC CCTAAAAGAA GAATGTTGTT 
ACAGCATGGC CAAACTAAGG GAAAGGCTTA 
AAGGTTGGTT CGAAGGGCAG TTTAATAGAT 
TCATGGGACC TCTAATAGTA CTCTTACTGA 
GATTAGTTCA ATTTGTTAAA GACAGGATCT 
AATACCACCA GCTAAAGCCT ATAGAGTACG 
TCATCGGCAT AGTATACGGC ATAGTATAAT 
AAGTTGACCA GTGCCGTTCC GGTGCTCACC 
TGGACCGACC GGCTCGGGTT CTCCCGGGAC 
CGGGACGACG TGACCCTGTT CATCAGCGCG 
CTGGCCTGGG TGTGGGTGCG CGGCCTGGAC 
TCCACGAACT TCCGGGACGC CTCCGGGCCG 
GGGCGGGAGT TCGCCCTGCG CGACCCGGCC 
CAGGACTGAW NNNCGGACCG GTCGA 



AGTACAATCT GCTCTGATGC 
GAGGTCGCTG AGTAGTGCGC 
AATTGCATGA AGAATCTGCT 
AGATATACGC GTTGACATTG 
TTAGTTCATA GCCCATATAT 
GGCTGACCGC CC^ACGACCC 
ACGCCAATAG GGACTTTCCA 
TTGGCAGTAC ATCAAGTGTA 
AAATGGCCCG CCTGGCATTA 
TACATCTACG TATTAGTCAT 
GGGCGTGGAT AGCGGTTTGA 
GGGAGTTTGT TTTGGCACCA 
CCATTGACGC AAATGGGCGG 
TGGCTAACTA GAGAACCCAC 
GGTGAGTTTG GGGACCCTTG 
TGGAGGGGGC AAAGTTTTCA 
TGGACCCTCA TGATAATTTT 
TCTTATTTTC TTTTCATTTT 
AATTTTTAAA TTCACTTTTG 
TTTTCAAGGC AATCAGGGTA 
TATAATTAAA TGATAAGGTA 
TCTTATTGGT AGAAACAACT 
ATGATATACA CTGTTTGAGA 
TAACCATGTT CATGCCTTCT 
GCTGTCTCAT CATTTTGGCA 
.CAGCGTTCTC AAAACCCCTT 
GGGTCTATTT AAGAGTAGGG 
GAGTCACCAA CCTGATGACT 
AAGATGCCTT CCCAAGATTA 
CTTCAGACCA GGAACCATAT 
CCCGGACTTT TGACTTTTAC 
CAAGAGAGGG CTACTGTGGT 
CCACATCATC ATGGGACCTA 
GCTCCAAAAT GGCTTGTGGC 
GGGCTACTCG AGGGGGCAGA 
AGGCTAATTG GGACGGGCCC 
CTATTACCAT GTTCTCCCTG 
GGCCTAATCC CGTGATCACT 
CCAGGCCTCC TCAGCCTCCT 
CTTCTCAACA ACCTGGGACG 
CGCTTAACCT CACCAATCCC 
CTCCTTATTA CGAAGGAGTA 
CCAGCTGTAC GGCCACTTCC 
TATGCATGGG AGCACTACCT 
GCTCAGGATC CTACTACCTT 
TGACTCCCTG CTTGTCCACC 
AGCTCTGGCC CAGAATAATT 
GTACCAAATA TAAGAGGGAG 
CCATGGGAGG GATTGCAGCT 
AGTTTGAGCA GCTTCACGCC 
CCAACCTAGA AAAGTCACTG 
TAGATTTGCT CTTCCTAAAA 
TTTATGCAGA CCACACGGGA 
ATCAGAGACA AAAACTATTT 
CCCCCTGGTT TACCACCTTA 
TCTTACTCTT TGGACCCTGC 
CAGTAGTCCA GGCTTTAGTC 
AGCCATAGGG CGCCTAGTGT 
ACGACTCACT ATAGGAGGGC 
GCGCGCGACG TCGCCGGAGC 
TTCGTGGAGG ACGACTTCGC 
GTCCAGGACC AGGTGGTGCC 
GAGCTGTACG CCGAGTGGTC 
GCCATGACCG AGATCGGCGA 
GGCAACTGCG TGCACTTCGT 



CGCATAGTTA 


60 


^ ^\ ^ ^ H lit PIMM * 

GAGCAAAATT 


120 


TAGGGTTAGG 


180 


ATTATTGACT 


240 


GGAGTTCCGC 


300 


CCGCCCATTG 


360 


TTGACGTCAA 


420 


TCATATGCCA 


480 


TGCCCAGTAC 


540 


CGCTATTACC 


600 


CTCACGGGGA 


660 


AAATCAACGG 


720 


TAGGCGTGTA 


780 


TGCTTAACTG 


840 


ATTGTTCTTT 


900 


GGGTGTTGTT 


960 


GTTTCTTTCA 


1020 


CTGTAACTTT 


1080 


TTTATTTGTC 


1140 


TATTATATTG 


1200 


GAATATTTCT 


1260 


ACATCCTGGT 


1320 


TGAGGATAAA 


1380 


TCTTTTTCCT 


1440 


AGGATCGGCC 


1500 


AAAGATAAGA 


1560 


ATGGCAGAGA 


1620 


GGGCGTACCG 


1680 


TATTTTGATC 


1740 


GTCGGGTATG 


1800 


GTGTGCCCTG 


I860 


GAATGGGGTT 


1920 


ATCTCCCTTA 


1980 


CCCTGCTACG 


2040 


TGCAACCCTC 


2100 


AAATCGTGGG 


2160 


ACCCGCCAGG 


2220 


GGTCAACTAC 


2280 


CCTACAGGCG 


2340 


GGAGACAGGC 


2400 


GACAAGACCC 


2460 


GCGGTCGTGG 


2520 


CAACATAAGC 


2580 


AAAACTCACC 


2640 


GCAGCACCCG 


2700 


ACGATGCTCA 


2760 


TACCACTCCC 


2820 


CCAGTATCGT 


2880 


GGAATAGGGA 


2940 


GCTATCCAGA 


3000 


ACCTCGTTGT 


3060 


GAGGGAGGTC 


3120 


CTAGTGAGAG 


3180 


GAGTCAGGCC 


3240 


ATCTCCACCA 


3300 


ATTCTCAATC 


3360 


CTGACTCAAC 


3420 


TGACAATTAA 


3480 


CACCATGGCC 


3540 


GGTCGAGTTC 


3600 


CGGTGTGGTC 


3660 


GGACAACACC 


3720 


GGAGGTCGTG 


3780 


GCAGCCGTGG 


3840 


GGCCGAGGAG 


3900 




3925 



INTERNATIONAL SEARCH REPORT 



Intern- *al Application No 

PCT/GB 96/02061 



A. CLASSIFICATION OF SUBJECT MATTER. 

IPC 6 C12N15/86 C12N5/10 C12N15/67 




According to International Patent Classification (IPC) or to both national classification and IPC 




B. FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12N 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category * 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


JOURNAL OF VIROLOGY 69 (7). 1995. 
4086-4094. ISSN: 0022-538X, 
July 1995, XP002023654 
LUUKKONEN B G M. ET AL: "Efficiency of 
reinitiation of translation on human 
immunodeficiency virus type 1 mRNAs is 
determined by the length of the upstream 
open reading frame and by intercistronic 
distance." 

see the whole document 


1-29 


A 


VIROLOGY (1995), 208(1), 215-25 CODEN: 
VIRLAX.-ISSN: 0042-6822, 

1 April 1995, XP002023655 
HERZOG, ETIENNE ET AL: "Translation of 
the second gene of peanut clump virus RNA 
2 occurs by leaky scanning in vitro" 
see the whole document 

-/-- 


1-29 


X Further documents are listed in the continuation of box C. 




X Patent family members are listed in annex. 


* Special categories of cited documents : 

'A' document defining the general state of the art which it not . 

considered to be of particular relevance 
"E" earlier document but published on or after the intemauonal 

filing date 

"L" document which may throw doubts on priority claimfs) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0" document referring to an oral disclosure, use, exhibition or 
other means 

*P" document published prior to the international filing date but 
later than the priority date claimed 


T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

*X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

*Y" document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art 

document member of the same patent family 


Date of the actual completion of the international search 




Date of mailing of the international search report 


23 January 1997 




1 2.02.97 




Name and mailing address of the ISA 

European Patent Office* P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Td. (+ 31-70) 340-2040, Tx. 31 651 «po nl. 
Fax (+31-70) 340-3016 


Authorized officer 

Hornig, H 



Form PCT/1SA/2J0 /lecond sheet) (July 1992) 



page 1 of 3 



INTERNATIONAL SEARCH REPORT 



tntemr «1 Application No 

PCT/GB 96/02061 



C(Continuarion) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' Citation of document, with indication, where appropriate, or the relevant passages 



Relevant to claim No. 



J. VIROL. (1993), 67(8), 4886-95 CODEN: 
J0V1AM;1SSN: 0G22-538X, 
August 1993, XP000616337 
FOUILLOT, NATHALIE ET AL': "Translation of 
the hepatitis B virus P gene by ribosomal' 
scanning as an alternative to internal 
initiation" 

see the whole document 
VIROLOGY, 

vol. 188, no. 1, Hay 1992, ACADEMIC PRESS, 

INC., NEW YORK, US, 

pages 342-352, XPGG2023656 

C.-G. LIN AMD S.J. L0: "Evidence for 

involvement of a ribosomal leaky scanning 

mechanism in the translation of the 

hepatitis B virus Pol gene from the viral 

pregenome RNA" 

see the whole document 

VIROLOGY, 

vol. 185, no. 2, December 1991, ACADEMIC 
PRESS, INC., NEW YORK, US, 
pages 862-866, XPO00616129 
F.-L. COSSET ET AL.: "Newcastle disease 
virus (NDV) vaccine based on immunization 
with avian cells expressing the NOV 
hemaggl uti ni n-neurami ni dase gl ycoprotei n " 
cited in the application 
see the whole document 

HOL. CELL. BIOL., 

vol. 7, no. 10, October 1987, ASM 

WASHINGTON, DC, US, 

pages 3438-3445, XP000616288 

M. KOZAK: "Effects of intercistronic 

length on the efficiency of reinitiation 

by eucaryotic ribosomes" 

cited in the application 

see the whole document 

EXPR1HENTAL CELL RESEARCH, 

vol. 197, no. 2, December 1991, ACADEMIC 

PRESS INC., NEW YORK, US, 

pages 229-233, XPOO0616323 

M. IZUMI ET AL. : "Blasticidin 

S-resi stance gene (bsr): a novel 

selectable marker for mammalian cells" 

cited in the application 

see the whole document 



-/- 



1-29 



1-29 



1-29 



1-29 



1-29 



Form PCT,tSA/3tO (cQMiauati'on of second sheet) (Jury 1993) 



page 2 of 3 



INTERNATIONAL SEARCH REPORT 


Interna'' 'i Application No 

PCT/liB 96/02G61 


C.(Conttmiation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category * 


Citation of document, witti indication, where appropriate, of the relevant passages i 


Relevant to claim No. 


A 


NATURE (LONDON) (1984), 309(5963), 82-5 
CODEN: NATUAS;ISSN: 0028-0836, 
3 May 1984, XP002023657 
LIU, CHUNG CHENG ET AL: "Initiation of 
translation at internal AUG codons in 
mamnalian cells" 
see the whole document 




1-29 


A 


WO, A, 94 24870 (BIOTRANSPLANT INC ;GEN 
HOSPITAL CORP (US); LE GUERN CHRISTIAN A 
(U) 10 November 1994 
see the whole document 




1-29 


A 


W0, A, 93 03143 (ANDERSON W FRENCH ; MORGAN 
RICHARD A (US); COUTURE LARRY (US)) 18 
February 1993 
see the whole document 




1-29 


A 


WO, A, 94 23048 (US HEALTH ;EIDEN MARYBETH V 
(US); WILSON CAROLYN A (US); DEACON NI) 13 
October 1994 
see the whole document 




1-29 


A 


PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES- OF USA, 

vol. 85, no. 17, 1 September 1988, 

pages 6460-6464, XP000569693 

DANOS 0 ET AL: "SAFE AND EFFICIENT 

GENERATION OF RECOMBINANT RETROVIRUSES 

WITH AMPHOTROPIC AND ECOTROPIC HOST 

RANGES" 

see the whole document 




1-29 


P.X 


J. VIROL. ( 1995 ) t 69(12), 7430-6 CODEN: 
JOVIAM;ISSN: O022-538X, 
December 1995, XP000569527 
COSSET, FRANCOIS-LOIC ET AL: "High-titer 
packaging cells producing recombinant 
retroviruses resistant to human serum" 
see the whole document 




1-29 



Form PCT.-1SA/210 (csnUfluation of lecond sheet) (July 1992) 



page 3 of 3 



INTERNATIONAL SEARCH REPORT 

. ..ormation on patent family members 


Intern.' al Application No 

PCT/GB 96/02061 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



W0-A-942487G 10-11-94 AU-A- 623G394 21-11-94 

CA-A- 2162G56 10-11-94 
. EP-A- 0706319 17-04-96 



WO-A-9303143 


18-G2-93 


CA-A- 
EP-A- 
JP-T- 


2114416 
0598029 
6509713 


18-02-93 
25-05-94 
02-11-94 


W0-A-9423048 


13-10-94 


AU-A- 


6903194 


24-10-94 






CA-A- 


2160034 


13-10-94 






EP-A- 


0699240 


06-03-96 






OP-T- 


8511156 


26-11-96 



Forni PCT;iSA/210 (p»l*nt firmly annex) (July 1993) 



